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INDUCIBLE ALPHAVIRAL/ORIP BASED GENE EXPRESSION SYSTEM 



BACKGROUND OF THE INVENTION 

5 Field of the Invention 

The present invention relates to compositions and methods for the production 
of polypeptides and/or untranslated RNA molecules in host cells. The invention 
provides nucleic acid molecules, expression systems, host cells, methods and kits 
that are useful for the production of polypeptides and/or untranslated RNA 
10 molecules. 

Related Art 

Process development for biopharmaceuticals is influenced by product quality 
and economy of the manufacturing process. The economic production of 

15 recombinant proteins in mammalian cells is dependent on the selection of the 
producing cell lines. The classical approach is the use of stable expression systems. 
These systems are based on chromosomal integration of an expression plasmid into 
the genome of the host cell. 

An alternative to stable expression systems is transient gene expression. For 

20 example, transient gene expression in, e.g., mammalian cells, at reactor scale is 
becoming increasingly important for the rapid production of recombinant proteins. 
Large scale applications of transient expression systems for recombinant protein 
production have been reported using COS, HEK 293 and BHK21 cells (Blasey et al., 
1996, Cytotechnology, 18, 138-192; Jordan et al., 1998, Cytotechnology, 26, 39-47; 

25 Wurm and Bemard, 1999, Curr Opin Biotechn 10, 156-159; Meissner et al., 2001, 
Biotechn and Bioeng, 75, 2, 197-203). 
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Transient expression can be performed by infection (using, e.g., viral vectors) 
or by transfection of the appropriate vector. Transfection can be performed with 
either a non-replicating or a replicating DNA vector or with a virus derived RNA 
vector. 

5 A variety of virus vectors for expression in higher eukaryotic cells have been 

developed by using recombinant DNA technology and introducing genes of interest 
into virus genomes. The initial viruses used in vector development were DNA 
vimses such as Adeno-associated viruses or Poxviruses as well as RNA viruses 
replicating through DNA intermediates such as Retroviruses. 

10 Episomal replicating vectors provide certain advantages over classical 

expression systems. Some DNA viruses, such as Adeno-, Papilloma-, Polyoma-, 
Hepadna-, and Herpesviruses, do not normally integrate into the host genome, but 
replicate episomally (extrachromosomally) in the nucleus of a host cells such as in a 
mammalian host cell. This process may involve both virus trans-acting factors and 

15 the host cell replication machinery. Episomal replicating vectors derived from these 
viruses generally contain a replication origin and at least one viral protein, e.g., an 
initiator protein. Examples of such initiator proteins are large T-antigen for SV40, 
E1/E2 for BPV, and EBNA-1 for EBV. 

Some host factors are able to recognize viral origins of replication and initiate 

20 replication. For example human S/MAR (scaffold/matrix attached region from the 
human interferon-beta gene) is able to maintain the episomal replication of a vector 
containing SV 40 origin of replication. On the other hand certain viral origins of 
replication (ORI) are recognized by corresponding specific virally-encoded proteins. 
For example, SV40 or polyoma virus ORI is recognized by large T antigen, BPV 

25 (Bovine papilloma virus) ORI is recognized by E1/E2 complex, EBV (Epstein-Barr 
virus) ORI is recognized by EBNAl protein, and yeast origin of replication (ARS 
element) is recognized by ORE (origin recognition element) replication (Bode et al., 
2001 Gene Ther Mol Biol, v6, 33-46). 

The double strand DNA genome of Epstein-Barr virus EBV (165 kb) is 

30 maintained in the latently infected cells as a large circular episome. Amplification of 
this episome is mediated by a cis-acting sequence such as the origin of replication 
(OriP), and one of the viral proteins, the Epstein-Barr viral nuclear antigen 1 
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(EBNAl). In a minimal system the presence of both OriP (1.8 kb region) and EBNA- 
1 is sufficient for replication and stabilization of recombinant plasmids. The exact 
function of EBNA-1 is still not clear, although the protein has a DNA binding 
domain. Generally EBNA-1 is involved in plasmid maintenance after completion of 
5 the synthesis and in plasmid segregation by dividing cells during proliferation. The 
OriP region is recognized also directly by the cellular DNA synthesizing machinery 
and is replicated during the cell cycle. 

Other gene expression systems are based on alphaviruses (Lundstrom, K., 
CurK Opin, Biotechnol 5:578-582 (1997)). Several members of the alphavirus 

10 family, e.g., Sindbis virus (Xiong, C. et al, Science 2^5:1188-1191 (1989); 
Schlesinger, S., Trends Biotechnol 77:18-22 (1993)), SFV (Liljestrom, P. & Garoff, 
H., Bio/Technology P:1356-1361 (1991)) and others (Davis, N.L. et aL, Virology 
777:189-204 (1989)) have received considerable attention for the use as virus-based 
expression vectors for a variety of different proteins (Lundstrom, K., Curr, Opin, 

15 Biotechnol <S:578-582 (1997); Liljestrom, P., Curr, Opin. Biotechnol 5:495-500 
(1994)). 

Alphaviruses are positive stranded RNA viruses which replicate their 
genomic RNA entirely in the cytoplasm of the infected cell and without a DNA 
intermediate (Strauss, J. and Strauss, E., Microbiol Rev, 55:491-562 (1994)). The 
20 concept that alphaviruses can be developed as expression vectors was first 
established nearly ten years ago (Xiong, C. et al, Science 2¥i: 1 188-1 191 (1989)). 
Since then, several improvements have made the use of these RNA replicons as 
expression vectors more practical (Lundstrom, K., Curr. Opin. Biotechnol <S:578-582 
(1997)). 

25 DNA vectors have been developed for both Sindbis virus (Herweijer, H. et 

al, Hum. Gene Ther. (5:1495-1501 (1995); Dubensky, T.W. et al, J. Virol 70:508- 
519 (1996)) and SFV (Berglund, P. et al. Trends Biotechnol 74:130-134 (1996)). 
Eukaryotic promoters are introduced in these vectors upstream from the alphavims 
replicase gene (consisting of the four non-structural protein genes (nsPl-4)), which 

30 are translated as one or two polyproteins which are then proteolytically cleaved. 
DNA is transcribed into RNA from the recombinant eukaryotic promoter in the 
nucleus and transported to the cytoplasm, where the replicase catalyzes the 




replication of the alphavirus RNA molecule as during normal replication of the 
alphavirus RNA molecule (Strauss, J. and Strauss, E., Microbiol Rev. 55:491-562 
(1994)). Only transient expression of heterologous sequences has been possible until 
recently due to the cytopathogenicity of the alphavirus replicase (Lundstrom, K., 
5 Curr. Opin. Biotechnol 5:578-582 (1997)). 

Among the features which make Alphavirus vectors attractive as transient 
expression systems are the easy construction of plasmid clones, a rapid gene 
expression (within 24h), a high level of expression, a wide host range of the virus 
(many insect, avian and mammalian cells), production of high-titered stocks of 

10 infectious particles and low risk of cell transformation due to the RNA nature of the 
genome and its replication entirely in the host cell cytoplasm without going through 
DNA intermediates. (Schlesinger S., 2000, Exp, Opin, Biol Then, 7, 2, UJ-lPl; 
Schlesinger S., Adv. Virus res., v55, 565-577; Garofif, H. and K,J. Li, 1998, Curr. 
Opin. Biotechnol, 9, 464-469; Lxmdstrom, K., 1997, Curr. Opin. Biotechnol, 8, 578- 

15 583; Xiong et al., 1989, Science, 243, 1188-1191). Blasey et al. compared the 
Semliki Forest Vims (SFV) expression system with the Epstein-Barr Virus (EBV) 
system (pCEP4 and HEK293EBNA cells) for expression of 5-HT3 receptors and 
foimd that both systems allow the production of functional recombinant protein 
within days to weeks at very high comparable levels (in: Animal Cell Technology, 

20 1999, pp331-337). 

One of the variants of alphavirus vectors is the replicon, which is capable of 
self-replication in the host cytoplasm after introduction into the cultured cells by 
transfection. In this replicon, viral structural protein genes have been replaced by a 
gene of interest (GOI). Such a replicon can be introduced into cells as RNA or cDNA 

25 if placed imder the control of an eukaryotic promoter or a promoter, which is 
recognized by the host cell machinery. (Agapov et al, Proc. Natl. Acad. Set USA 
95:12989-12994(1998)). 

RNA viruses have been engineered into chimeric viruses producing 
heterologous proteins. Such RNA virus vectors have been developed from 

30 Alphaviruses (with positive strand genome), Influenza and VSV (with negative 
strand genome). One of the major advantages of these viruses is their RNA-only 
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replication strategy performed entirely in the cell cytoplasm and eliminating the risk 
of genome integration. 

Adeno-, Adeno-associated and Retrovirus vectors can be used as gene 
therapy vectors and/or as expression vectors, especially for large-scale protein 
5 production. Herpesvirus derivative vectors containing a replication origin have been 
used for bioreactor protein production after transient transfection of a suitable cell 
line carrying a replication initiation gene. (Durocher et al.. Nucleic Acids Res, 
30(2):E9 (2002); Meissner et al, Biotechnol Bioeng. 75(2): 197-203 (2001)). In 
these experiments, 293 E cells (293 human embryo kidney cells expressing Epstein- 

10 Barr virus nuclear antigene - EBNA-1) were used which provided a replication 
initiation factor in trans to a plasmid carrying EBV origin of replication - OriP. 
Reporter genes (encoding secreted, intracellular or transmembrane proteins) were 
constitutively expressed over a period of several days. 

Hybrid vectors have also been reported, mainly as gene delivery systems or 

15 for use in virus biology research, such as combinations between EBV episomes and 
Herpes-, Adeno- (DNA) or Retro- (RNA-DNA) viruses (Sena-Esteves M. et al, 1999, 
Virol Dec;73(12):10426-39; Tan BT, et al.,1999, J Virol Sep; 73 (9): 7582-9; 
Grignani F. et al., 1998, Cancer Res Jan 1;58(1):14'9; Wang S. and Vos JM., 1996, 
J Virol Dec;70(12):8422-30). EBV episomes are nuclear plasmids that are stably 

20 maintained through multiple cell divisions in primate and canine cells (J. L. Yates, N. 
Warren, and B. Sugden, 1985, Nature 313:812-815). 

The ability to precisely control the expression of genes introduced into animal 
or himian cells, or in whole organisms, enables significant progress in many areas of 
biology and medicine. For instance, methods that allow the intentional manipulation 

25 of gene expression will facilitate the analysis of genes whose expression cannot be 
tolerated constitutively. 

To be of broad benefit, gene regulation techniques preferably allow for rapid, 
robust, precise and reversible induction of gene activity. As reviewed in Saez, E. et 
al, (Curr, Opin, Biotechnol 5:608-616 (1997)), an ideal system would fulfill the 

3 0 following requirements : 

1. Specificity ~ The system should be indifferent to endogenous factors 
and activated only by exogenous stimuli. 
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2. Non-interference ~ The components of the system should not affect 
unintended cellular pathways. 

3. Inducibility ~ In the inactive state, the basal activity of the system 
should be minimal, while in the active state high levels of gene expression should be 

5 rapidly inducible, 

4. Bioavailability of the inducer — Inducing stimuli should rapidly 
penetrate to the site of interest. 

5. Reversibility — Inducing stimuli should clear swiftly to allow the 
system to rapidly return to the inactive state. 

10 One common system currently in use for the regulation of gene expression is 

the tetracycline-based system (Gossen and Bujard, Proc. Natl Acad. Sci. USA 
89:5547 (1992)). This system is based on the continuous expression of a fiision 
protein with the tetracycline repressor protein (tetR) being converted into an activator 
by fusion to the transcriptional activation domain of the VP 16 protein. The tTA 

15 system is usefiil for, e.g,^ inducible gene expression and has been successfully used 
for the production of a number of proteins (Wimmel et al, Oncogene P:995 (1994); 
Friih et al, EMBO J. 75:3236 (1994); Yu et al, J. Virol 70:4530 (1996)). However, 
there are certain disadvantages associated with the tTA system. (Schocket et al, 
Proc. Natl Acad Sci, USA 92:6522 (1995); Howe et al, J, Biol Chem, 25:14168 

20 (1995); Schocket and Schatz, Proc. Natl Acad. Set USA 93:5173 (1996); Bohl et al, 
Nat. Med 5:299 (1997); Furth et al, Proc. Natl Acad Sci USA 91:9302 (1994); 
Hennighausen et al, J. Cell Biochem. 5P:463 (1995); Kistner et al, Proc. Natl 
Acad Sci USA P5: 10933 (1996); Hofl&nann et al. Nucleic Acids Res. 25:1078 
(1997)). 

25 Inducible expression systems based on viral expression vectors (e.g., 

aphaviral vectors) are described in WO 99/50432 (incorporated herein by reference 
in its entirety). An exemplary alphaviral vector is pCytTs (also known as CytTs; cf. 
WO 99/50432). Using these systems, it is possible to precisely induce and regulate 
gene expression. This precise regulation results from the use of a temperature- 

30 sensitive RNA-dependent RNA polymerase (i.e., a replicase), which only replicates 
RNA molecules, to form new RNA molecules, at permissive temperatures (Boorsma 
et al. Nature Biotech. 18:429-432 (2001)). Due to the extremely tight regulation of 
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these systems it is possible to obtain regulated gene expression even under conditions 
of transient transfection allowing the production of toxic proteins under these 
conditions. 

Another example of an alphaviral expression vector is described in 
5 Lundstrom et al, FEES Lett 50^:99-103 (2001). Lxmdstrom describe a rapidly 
generated high-titer Semliki Forest virus (SFV) vectors capable of infecting 
numerous mammalian cell lines and primary cell cultures, resulting in high levels of 
transgene expression, SFV-based expression of transmembrane receptors has been 
characterized by specific ligand-binding activity and functional responses. 

10 Adaptation of the SFV technology for mammalian suspension cultures has allowed 
the production of hundreds of milligrams of recombinant receptor for purification 
and stmctural studies. The same SFV stock solutions used for the infection of 
mammalian cells in culture have also been successfully applied for efficient 
transgene expression in organotypic hippocampal slices, as well as in vivo in rodent 

15 brain. 

A need exists in the art, however, for compositions and methods that allow 
the production of polypeptides and xmtranslated RNA molecules. 



BRIEF SUMMARY OF THE INVENTION 

20 The present invention provides compositions and methods that allow the 

production of polypeptides and/or untranslated RNA molecules. More specifically, 
the invention provides nucleic acid molecules, expression systems, recombinant host 
cells that permit the production of polypeptides and/or untranslated RNA molecules. 
Also provided are methods for making the nucleic acid molecules, expression 

25 systems and host cells of the invention. The invention further provides methods for 
producing polypeptides and/or untranslated RNA molecules. The invention also 
provides kits that comprise the nucleic acid molecules, expression systems and/or 
recombinant host cells of the invention. 

In certain embodiments, the invention provides compositions and methods for 

30 regulated, expression of polypeptides or untranslated RNA molecules in recombinant 
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host cells, preferably by way of transfection. The present invention also provides 
nucleic acid molecules and methods which allow, e.g., rapid and high level 
production of specific RNA molecules produced in, e.g., transiently transfected, 
recombinant host cells. 

5 According to certain aspects of the invention, novel, preferably inducible, 

potent and rapid expression systems are provided comprising one or more nucleic 
acid molecules. The nucleic acid molecules of the expression systems of the 
invention may be maintained at, or accumulated to, multiple copies in the nuclei of 
recombinant host cells, preferably of transiently transfected mammalian cells. In 

10 certain embodiments, RNA self replication may lead to the accumulation of a high 
number of RNA molecules in the cytoplasm of the transfected cells and subsequent 
translation of the polypeptide of interest. The nucleic acid molecules and expression 
systems of the invention are particularly useful for transient transfection. 

The present invention further provides nucleic acid molecules and methods 

15 which allow inducible rapid and high level production of specific RNA molecules 
produced in, preferably transiently transfected, recombinant host cells. In certain 
embodiments, RNA self-replication is preferably inducible and triggered by a 
temperature shift. Thus, the high production levels, in certain embodiments, are 
achieved by the use of a temperature-sensitive RNA-dependent RNA polymerase 

20 (Le., a replicase) which only replicates RNA molecules to form new RNA molecules 
at permissive temperatures. Moreover, if desired, the amount of RNA or polypeptide 
can be tightly regulated by the use of a temperature-sensitive RNA-dependent RNA 
polymerase. This facilitates, in particular, production of growth-inhibitory or toxic 
polypeptides or RNA-species. 

25 In a general aspect, the present invention provides for a nucleic acid molecule 

comprising (A) a first polynucleotide element which encodes an RNA molecule, said 
RNA molecule comprising (a) at least one c/5-acting sequence element; (b) a first 
nucleotide sequence comprising a first open reading frame, said first open reading 
fi-ame having a nucleotide sequence encoding an RNA-dependent RNA polymerase; 

30 and (c) at least one second nucleotide sequence selected from the group consisting of 
(i) a second open reading frame encoding a polypeptide; (ii) a nucleotide sequence 
complementary to all or part of the second open reading frame of (i); and (iii) a 
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nucleotide sequence encoding an untranslated RNA molecule or complement thereof; 
wherein said second nucleotide sequence is operably linked to a promoter which is 
recognized by said RNA-dependent RNA polymerase; (B) a second DNA element 
comprising an origin of replication; and (C) a third DNA element encoding a 
5 replication initiation factor capable of recognizing the origin of replication. Thus, the 
replication initiation factor preferably recognizes the origin of replication leading to 
replication of the second polynucleotide and second DNA element, respectively, 
comprising the origin of replication. 

In certain embodiments, the RNA-dependent RNA polymerase is, e.g., a non- 
10 cytopathic RNA-dependent RNA polymerase. In other embodiments, the RNA- 
dependent RNA polymerase is, e.g., a temperature-sensitive RNA-dependent RNA 
poljonerase. In yet other embodiments, the RNA-dependent RNA polymerase is, 
e.g., a non-cytopathic, temperature-sensitive RNA-dependent RNA polymerase. 

The nucleic acid molecules of the invention, in certain embodiments, 
15 comprise a 5' promoter which is capable of initiating transcription in vivo, 5' and/or 3' 
sequences enabling replication of the RNA molecule (cw-acting sequence elements), 
and a subgenomic promoter 5' to the gene of interest, as well as a sequence of interest 
which is translatable only after one or more RNA-dependent RNA replication events. 
These RNA-dependent RNA replication events are catalyzed by a, preferably 
20 regulatable, RNA-dependent RNA polymerase which may be encoded by the same 
mRNA molecule that is produced by transcription of the DNA vector or by a 
different mRNA molecule. 

The inventive nucleic acid molecules, expression systems and vector systems 
of the present invention thus allow for large-scale transient transfection and very 
25 rapid production of polypeptides eliminating the need of isolating stably transformed 
high producing cell clones. The preferred embodiment and hereby the inducibility of 
the vector makes the invention especially suitable for the production of cytotoxic 
polypeptides, e.g., polypeptides that are detrimental to the viability of a host cell. 

In a preferred embodiment of the present invention, the origin of replication 
30 and the replication initiation factor are derived from a DNA virus. For example, the 
origin of replication may be derived from a herpes virus and the replication initiation 
factor may be the Epstein-Barr virus nuclear antigen 1 (EBNAl). However, 
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replication initiation factors may also derive from other viruses or be cellular factors, 
which are capable of recognizing or recognize viral origins of replication. As 
indicated, in a further embodiment, the RNA-dependent RNA-polymerase is 
temperature-sensitive and temperature-regulated, respectively. Preferably, the RNA- 
5 dependent RNA-polymerase is of viral, more preferably of alphaviral origin. 

Thus, in a preferred embodiment, the present invention provides a hybrid 
vector and vector system combining the advantages of EBV episomal replication 
with the advantages of RNA virus replication, wherein the replication takes place 
entirely in the cytoplasm. This type of hybrid vector would be characterized by the 

10 maintenance of multiple copies of DNA due to polynucleotide replication, typically 
plasmid replication, followed by the accumulation of full-length transcripts 
containing the viral replicon. These viral full-length transcripts are then further 
exponentially amplified in the cytoplasm by the viral replicase leading to the 
accumulation of RNA virus replicons and protein synthesis. The system is extremely 

15 valuable for production of toxic polypeptides if the RNA vims replication is 
controllable or inducible. 

In a specific embodiment, the invention includes the combination of a 
Herpesvirus mini-replicon unit, i.e. the cis-acting replication origin OriP (Origin of 
replication P) and the cis- or trans-acting gene product - EBNA-1 (Epstein-Barr 

20 virus nuclear antigen 1, a replication triggering factor) with the tightly regulated 
temperature inducible alphaviral expression system pCytTS, as disclosed in WO 
99/504332. Due to the presence of OriP and EBNA-1 the introduced novel vector is 
maintained episomally (extrachromosomally) in the form of several DNA copies in 
the cell nuclei. These multiple DNA copies may be transcribed from a promoter, 

25 typically a CMV or RSV promoter, into CytTS RNA-replicons, but they remain 
inactive unless cells are being shifted to a certain temperature. Only after temperature 
induction replicon replication takes place, followed by RNA accumulation and 
translation in the cell cytoplasm and production of the polypeptide of interest. 

In a further general aspect, the present invention provides a vector system 

30 comprising one or more nucleic acid molecules, wherein said one or more nucleic 
acid molecules comprise (A) a first polynucleotide which encodes an RNA molecule, 
said RNA molecule comprising (a) at least one c/5-acting sequence element; (b) a 
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first nucleotide sequence comprising a first open reading frame having a nucleotide 
sequence encoding an RNA-dependent RNA polymerase; and (c) at least one second 
nucleotide sequence selected from the group consisting of (i) a second open reading 
frame encoding a polypeptide; (ii) a sequence complementary to all or part of the 
5 second open reading frame of (i); and (iii) a sequence encoding an untranslated 
RNA molecule, or complement thereof; wherein said second nucleotide sequence is 
operably linked to a promoter which is recognized by said non-cytopathic RNA- 
dependent RNA polymerase; (B) a second polynucleotide comprising an origin of 
replication; and (C) a third polynucleotide encoding a replication initiation factor 
10 capable of recognizing the origin of replication. Thus, the replication initiation factor 
recognizes the origin of replication leading to replication of the second 
polynucleotide and second DNA element, respectively, comprising the origin of 
replication. 

The invention fiuther provides single- and multiple-vector systems for 

15 producing a polypeptide or untranslated RNA molecule. In a single-vector system, 
the first, second and third polynucleotide are present on the same nucleic acid 
molecule. In a multiple-vector system, the first, second and third polynucleotide 
elements are present on one or more separate nucleic acid molecules. In such a 
multiple vector system, at least one, preferably some or more preferably all vectors 

20 contain an origin of replication. Therefore, in preferred embodiments of the 
invention, the replication initiation factor and the origin of replication recognizing 
the replication initiation factor may be on the same nucleic acid molecule and 
plasmid, respectively, as it is the RNA-replicase or on a separate nucleic acid 
molecule and plasmid, respectively. 

25 When sequences encoding the first and second open reading frame are 

present either on the same nucleic acid molecule or in the same vector (i.e., in a 
single- vector system), a region will preferably be present 5' to the second open 
reading frame which inhibits translation of this open reading frame. 

If a temperature sensitive RNA-dependent RNA polymerase (e.g., a 

30 temperature-sensitive replicase) is included within the compositions and methods of 
the invention, the temperature-sensitive replicase may be "cold" or "hot" sensitive 
and thus will only efficiently catalyze RNA-dependent RNA replication at 
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temperatures either above or below the restrictive temperature. In one preferred 
embodiment, the compositions and methods of the invention comprise, or involve the 
use of, an RNA-dependent RNA polymerase that has replicase activity at 
temperatures below 34°C which is at least five fold greater than the replicase activity 
5 exhibited at 34°C or above. Preferably, the replicase activity at temperatures below 
34°C is at least ten fold greater than the replicase activity exhibited at 34*^C or above. 
In certain embodiments, the replicase activity at temperatures below 34°C is at least 
one hxmdred fold greater than the replicase activity exhibited at 34''C or above. 
Temperature-sensitive RNA-dependent RNA polymerases included within the 

10 present invention may, in certain embodiments, exhibit no (or imdetectable) replicase 
activity at 34°C or above. 

In another aspect, the nucleic acid molecules and expression systems of the 
invention encode one or more cytokine, lymphokine, tumor necrosis factor, 
interferon, toxic protein, prodrug converting enzyme, or other polypeptide. 

15 In yet another aspect, the nucleic acid molecules and expression systems of 

the invention encode an untranslated RNA molecule, such as an antisense RNA 
molecule, tRNA molecule, rRNA molecule, or ribozyme. 

The invention also provides methods for making recombinant host cells 
comprising introducing nucleic acid molecules of the invention into host cells. 

20 Further provided are recombinant host cells produced by the introduction of nucleic 
acid molecules of the invention. In one embodiment, one, some or all of these 
recombinant host cells contain one or more nucleic acid molecules that comprise an 
RNA-dependent RNA polymerase and/or the replication initiation factor. The 
replication initiation factor, in certain embodiments, may be stably integrated into the 

25 genome of the one, some or all host cells. 

The invention further provides isolated nucleic acid molecules comprising 
polynucleotides which comprise the nucleotide sequence of SEQ ID NO:21 (pCytTs- 
OriP) and SEQ ID NO:22 (pCytTs-OPE). 

The present invention also provides methods for producing polypeptides and 

30 untranslated RNA molecules in recombinant host cells, said methods comprising 
introducing a nucleic acid molecule of the invention into a host cell to produce a 
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recombinant host cell, and culturing the recombinant host cell under conditions 
suitable for expression of said polypeptide or untranslated RNA molecule. The 
methods of the invention may further comprise recovering said polypeptide or 
untranslated RNA molecule. 
5 Methods are also provided for the expression of heterologous polypeptides, 

preferably the regulated expression of heterologous polypeptides, including 
cytokines, lymphokines, tumor necrosis factors, interferons, toxic polypeptides, and 
prodrug converting enzymes. 

Further provided are polypeptides and untranslated RNA molecules produced 

10 by the methods of the invention. 

The invention also provides methods for regulating the expression of a 
polypeptide or an untranslated RNA molecule, said method comprising introducing 
nucleic acid molecules of the invention into a host cell to produce a recombinant host 
cell and growing said recombinant host cell under suitable culture conditions. In 

15 certain embodiments, nucleic acid molecules encoding a temperature-sensitive RNA- 
dependent RNA-polymerase are used. When a temperature-sensitive RNA-dependent 
RNA-polymerase is used, the methods according to this aspect of the invention may 
further comprise changing the temperature of the recombinant host cell culture from 
either (i) a permissive temperature to a restrictive temperature, or (ii) a restrictive 

20 temperature to a permissive temperature. 

In certain embodiments, the methods of the invention involve introducing the 
nucleic acid molecules of the invention into prokaryotic or eukaryotic host cells to 
produce a recombinant host cell, and then culturing said recombinant host cell imder 
suitable culture conditions. The recombinant host cells may be cultured, e.g., in a 

25 serum-free or protein-free medium. 

The present invention also provides pharmaceutical compositions comprising 
nucleic acid molecules of the invention and a pharmaceutically acceptable carrier. 

Also included within the invention are kits comprising the nucleic acid 
molecules of the invention. Kits of the invention may additionally or alternatively 

30 comprise one or more expression systems of the invention and one or more 
recombinant host cells of the invention. 
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BRIEF DESCRIPTION OF THE DRAWING 

Fig. 1 is a schematic representation of vector pCytTs-EGFP-OPE described in 
Example 2. The EGFP expression cassette can be replaced by any gene of interest 
(e.g. SEAP, EPO, IFN). CMV: Cytomegalovirus promoter, pxiro: pxiromycin 
5 resistance marker; amp: ampicillin resistance marker; ColEl: bacterial origin of 
replication; SG prom: subgenomic promoter; ORIP, EBV origin of replication; 
EBNAl, EBV nuclear antigen 1; GFP: green fluorescence protein; nspl-4: genes 
coding for the viral non-structural proteins containing the nsp 4 (Pro728Ser) 
mutation and the nsp2 (Glyl53Glu) mutation that renders the replicase temperature 
10 sensitive and non-cj^opathic. 



DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed to compositions and methods that are useful, 
e.g., in the production of polypeptides and/or untranslated RNA molecules in host 

15 cells. The compositions and methods of the invention are also useful, e.g., for the 
isolation and/or purification of polypeptides, polypeptides and untranslated RNA 
molecules that are produced in host cells. 

According to one aspect of the invention, nucleic acid molecules are 
provided. The nucleic acid molecules of the invention comprise: (A) a first 

20 polynucleotide element which encodes an RNA molecule, said RNA molecule 
comprising: (a) at least one c/^-acting sequence element; (b) a first nucleotide 
sequence comprising a first open reading frame, said first open reading frame having 
a nucleotide sequence encoding an RNA-dependent RNA poljnnerase; and (c) at 
least one second nucleotide sequence selected from the group consisting of: (i) a 

25 second open reading frame encoding a polypeptide; (ii) a nucleotide sequence 
complementary to all or a part of the second open reading fi-ame of (i); and (iii) a 
nucleotide sequence encoding an untranslated RNA molecule or complement thereof; 
wherein said second nucleotide sequence is operably linked to a promoter which is 
recognized by said RNA-dependent RNA polymerase; (B) a second polynucleotide 

30 element comprising an origin of replication; and (C) a third polynucleotide element 
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encoding a replication initiation factor capable of recognizing said origin of 
replication. 

According to another aspect of the invention, expression systems are 
provided. The expression systems of the invention comprise one or more nucleic 
5 acid molecules, wherein said one or more nucleic acid molecules comprise: (A) a 
first polynucleotide element which encodes an RNA molecule, said RNA molecule 
comprising: (a) at least one c/^-acting sequence element; (b) a first nucleotide 
sequence comprising a first open reading frame, said first open reading frame having 
a nucleotide sequence encoding an RNA-dependent RNA polymerase; and (c) at 

10 least one second nucleotide sequence selected from the group consisting of: (i) a 
second open reading fi-ame encoding a polypeptide; (ii) a nucleotide sequence 
complementary to all or a part of the open reading frame of (i); and (iii) a nucleotide 
sequence encoding an untranslated RNA molecule or complement thereof; wherein 
said second nucleotide sequence is operably linked to a promoter which is 

15 recognized by said RNA-dependent RNA polymerase; (B) a second polynucleotide 
element comprising an origin of replication; and (C) a third polynucleotide element 
encoding a replication initiation factor capable of recognizing said origin of 
replication. 

In the expression systems of the invention, said first, second and third 
20 polynucleotide elements may each be on a separate nucleic acid molecule. 
Alternatively, said first, second and third polynucleotide elements may each be on a 
single nucleic acid molecule. In certain embodiments, said first and second 
polynucleotide elements will be on a single nucleic acid molecule while said third 
polynucleotide element is on a different nucleic acid molecule. In certain other 
25 embodiments, said first and third poljmucleotide elements will be on a single nucleic 
acid molecule, while said second polynucleotide element is on a different nucleic 
acid molecule. In certain other embodiments, said second and third polynucleotide 
elements will be on a single nucleic acid molecule, while said first poljaiucleotide 
element is on a different nucleic acid molecule. 
30 According to another aspect of the invention, recombinant host cells and in 

vitro cell cultures comprising recombinant host cells are provided. The recombinant 
host cells of the invention comprise: (A) a fu-st polynucleotide element which 
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encodes an RNA molecule, said RNA molecule comprising: (a) at least one cis- 
acting sequence element; (b) a first nucleotide sequence comprising a first open 
reading frame, said first open reading frame having a nucleotide sequence encoding 
an RNA-dependent RNA polymerase; and (c) at least one second nucleotide 
5 sequence selected from the group consisting of: (i) a second open reading frame 
encoding a polypeptide; (ii) a nucleotide sequence complementary to all or a part of 
the second open reading frame of (i); and (iii) a nucleotide sequence encoding an 
untranslated RNA molecule or complement thereof; wherein said second nucleotide 
sequence is operably linked to a promoter which is recognized by said RNA- 

10 dependent RNA polymerase; (B) a second polynucleotide element comprising an 
origin of replication; and (C) a third polynucleotide element encoding a replication 
initiation factor capable of recognizing said origin of replication. 

According to another aspect of the invention, methods are provided for 
producing, or regulating the expression of, a polypeptide or imtranslated RNA 

15 molecule. The methods according to this aspect of the invention comprise: (a) 
introducing a nucleic acid molecule or an expression system of the invention into a 
host cell to produce a recombinant host cell; and (b) culturing said recombinant host 
cell. The methods of the invention may fiirther comprise recovering said polypeptide 
or untranslated RNA molecule. 

20 According to another aspect of the invention, kits are provided comprising 

the nucleic acid molecules, expression systems and/or host cells of the invention. 

In certain embodiments, said second open reading frame that is included 
within the compositions and methods of the invention is in a translatable format after 
one or more RNA-dependent RNA replication events. The expression "a translatable 

25 format" is intended to mean an RNA from which a protein can be made. 

The RNA-dependent RNA polymerase that is included in, or used with, the 
compositions and methods of the invention may, in certain embodiments, be selected 
from the group consisting of: (a) a temperature-sensitive RNA-dependent RNA 
polymerase, (b) a non-cytopathic RNA-dependent RNA polymerase, and (c) a 

30 temperature-sensitive, non-cytopathic RNA-dependent RNA polymerase. 

When the RNA-dependent RNA polymerase that is included in, or used with, 
the compositions and methods of the invention is temperature sensitive, the 
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temperature-sensitive RNA-dependent RNA polymerase may have replicase activity 
at temperatures below 34°C which is at least five fold, at least ten fold, at least 
twenty fold, at least thirty fold, at least forty fold, at least fifty fold, at least one 
hundred fold, or at least one thousand fold greater than the replicase activity 
5 exhibited at 34°C or above. For example, the temperature-sensitive RNA-dependent 
RNA polymerase may have replicase activity at 34°C that is at least five fold greater 
than the replicase activity exhibited at 29°C. The temperature-sensitive RNA- 
dependent RNA polymerase may, in certain embodiments, exhibit no or undetectable 
replicase activity (using methods of detection known in the art) at 34°C or above. 

10 The RNA-dependent RNA polymerase that is included in, or used with, the 

compositions and methods of the invention may, in certain embodiments, be of viral 
origin. For example, the RNA-dependent RNA poljonerase may be of alphaviral 
origin. In specific embodiments of the invention, the RNA-dependent RNA 
polymerase is derived from a Sindbis virus, a Semliki Forest virus or an Aura virus. 

15 The RNA-dependent RNA polymerase may be derived from a virus selected from 
the group consisting of Bebaru virus, Cabassou virus, Chikimgunya virus, Easter 
equine encephalomyelitis virus. Fort Morgan virus, Getah virus, Kyzylagach virus, 
Mayoaro virus, Middleburg virus, Mucambo virus, Ndumu virus, Pixuna virus, 
Tonate virus, Triniti virus, Una vims. Western equine encephalomyelitis virus, 

20 Whataroa virus, Venezuelan equine encephalomyelitis virus (VEE), and Ross River 
vims. 

The origin of replication that is included in, or used with, the nucleic acid 
molecules, the expressions systems, compositions and methods of the invention may 
be derived from a prokaryotic organism, a eukaryotic organism (e.g., a yeast, insect 

25 or mammal), or a vims. For example, the origin of replication may be derived from a 
DNA vims, e.g., a DNA vims that allows for episomal replication. The origin of 
replication may be derived from a DNA vims selected from the group consisting of 
Herpesvims, Epstein-Barr vims (EBV), Papillomavims, Polyomavims, Adenovims, 
and Hepadnavims. In a specific embodiment, the origin of replication is or/P, 

30 derived from EBV. 

The replication initiation factor that is included in, or used with, the 
compositions and methods of the invention is, in certain embodiments, capable of 
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operating as a plasmid maintenance factor. As used herein, the term "plasmid 
maintenance factor" is intended to mean a factor, which supports the distribution of 
the episomes to the daughter cells upon cell division. In the case of EBNA-1, this is 
achieved by the binding of the protein to the episome and, at the same time, to the 
5 chromosome, thereby leading to proper distribution of the episomes to the daughter 
cells. The replication initiation factor may be derived from a prokaryotic organism, a 
eukaryotic organism (e.g.^ a yeast, insect or mammal), or a virus. For example, the 
replication initiation factor may be derived from a DNA virus, e.g., a DNA virus that 
allows for episomal replication. The replication initiation factor may be derived 
10 from a DNA virus selected from the group consisting of Herpesvirus, Epstein-Barr 
virus (EBV), Papillomavirus, Polyomavirus, Adenovirus, and Hepadnavirus, In a 
specific embodiment, the replication initiation factor is the EBNA-1 protein, derived 
from EBV. 

The origin of replication and the replication initiation factor that are included 

15 in, or used with, the compositions and methods of the invention may, in certain 
embodiments, be derived from the same organism or the same virus. Altematively, 
the origin of replication and the replication initiation factor may be derived from 
different organisms or viruses. 

The nucleic acid molecules and expression systems of the invention may 

20 further comprise a fourth polynucleotide element, wherein said fourth polynucleotide 
element comprises a selection marker. Any selection marker known in the art may 
be used, including, e.g. , markers that confer resistance to antibiotics. An exemplary 
selection marker is one that confers resistance to puromycin. Altematively, selection 
markers may be used that confer resistance to hygromycin, gpt, neomycin, zeocin, 

25 ouabain, blasticidin, or bleomycin, or markers such as DHRF, hisD, trpB, or 
glutamine synthetase. 

The second open reading frame that is included in, or used with, the 
compositions and methods of the invention may encode, in certain embodiments, any 
polypeptide of interest. For example, the second open reading frame may encode, 

30 e.g., Si cytokine, a lymphokine, a tumor necrosis factor, an interferon {e.g., beta- 
interferon, including human beta-interferon), a toxic polypeptide, or a prodrug. 
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The second open reading frame that is included in, or used with, the 
compositions and methods of the invention may encode, in certain embodiments, an 
untranslated RNA molecule. Exemplary imtranslated RNA molecules include, e.g,^ 
an antisense RNA molecule, a tRNA molecule, an rRNA molecule and a ribozjone. 
5 Other useful untranslated RNA molecules are known in the art. 

Thus, in cert£iin exemplary embodiments, the invention provides 
compositions and methods which allow for the production of high amounts of 
specific RNA molecules produced in host cells upon their transient transfection with 
said polynucleotides. This high production results, e.g., from the use of a self- 
10 replicating episomal DNA plasmid combined with an RNA-dependent RNA 
polymerase, which amplifies the mRNA of interest. 

Expression, using the compositions and methods of the invention, may be 
regulated precisely, if necessary. Such precise regulation can be achieved, e.g^ from 
the use of a temperature-sensitive RNA-dependent RNA polymerase which only 
15 replicates RNA molecules to form additional RNA molecules at permissive 
temperatures. 

According to the present invention, polypeptides and/or imtranslated RNA 
molecules may be produced in transient transfection experiments due to the use of 
nucleic acid molecules comprising an origin of replication and DNA encoding a 

20 protein (a factor or a DNA-dependent replicase) that can recognize the origin of 
replication, thereby resulting in replication of the plasmid. 

The expression systems of the invention, in certain embodiments, comprise 
alphavirus DNA vectors that can be used to create transiently transfected cell lines. 
Said alphavirus DNA vectors may carry genes encoding a non-cytopathic replicase 

25 and viral non-structural proteins, thereby being able to produce high amoxmts of 
polypeptide. An induction of expression is achieved when the activity of the 
temperature-sensitive replicase is switched on by reducing the incubation 
temperature of the transfected cells from 37^C to a temperature lower than 34°C. 
Protein expression in the host at 37^C is, preferably, below the level of detection. 



30 
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DEFINITIONS 

The following definitions are provided to clarify the subject matter, which the 
inventors consider to be the present invention. 
5 As used herein, a nucleic acid molecule is a sequence of contiguous 

nucleotides (riboNTPs, dNTPs or ddNTPs, or combinations thereof) of any length 
which may encode a full-length polypeptide or a fragment of any length thereof, or 
which may be non-coding. As used herein, the terms "nucleic acid molecule," and 
"polynucleotide," "polynucleotide construct," and "polynucleotide element" may be 

10 used interchangeably. 

As used herein, the term "alphavirus" refers to any of the RNA viruses 
included within the genus Alphavirus. Descriptions of the members of this genus are 
contained in Strauss and Strauss, Microbiol. Rev., 55:491-562 (1994). Examples of 
alphaviruses are selected from the group comprising Aura virus, Bebaru virus, 

15 Cabassou virus, Chikimgunya virus, Easter equine encephalomyelitis virus. Fort 
morgan virus, Getah virus, Kyzylagach virus, Mayoaro virus, Middleburg vims, 
Mucambo virus, Ndumu vims, Pixuna virus, Tonate vims, Triniti virus, Una virus. 
Western equine encephalomyelitis vims, Whataroa vims, Sindbis vims (SIN), 
Semliki forest vims (SFV), Venezuelan equine encephalomyelitis vims (VEE), and 

20 Ross River vims. 

As used herein, the term "purified" used in reference to a molecule means that 
the concentration of the molecule being purified has been increased relative to 
molecules associated with it in its natural environment. Naturally associated 
molecules include polypeptides, nucleic acids, lipids and sugars but generally do not 

25 include water, buffers, and reagents added to maintain the integrity or to facilitate the 
purification of the molecule. For example, even if mRNA is diluted with an aqueous 
solvent during oligo dT column chromatography, mRNA molecules are purified by 
this chromatography if naturally associated nucleic acids and other biological 
molecules do not bind to the column and are separated from the subject mRNA 

30 molecules. 

As used herein, the term "isolated" used in reference to a molecule means that 
the molecule has been removed from its native environment. For example, a 
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polynucleotide or a polypeptide naturally present in a living animal is not "isolated," 
but the same polynucleotide or polypeptide separated from the coexisting materials 
of its natural state is "isolated." Further, recombinant DNA molecules contained in a 
vector are considered isolated for the purposes of the present invention. Isolated 
5 RNA molecules include in vivo or in vitro RNA replication products of DNA and 
RNA molecules. Isolated nucleic acid molecules further include synthetically 
produced molecules. Additionally, vector molecules contained in recombinant host 
cells are also isolated. Thus, not all "isolated" molecules need to be "purified." 

As used herein, the phrase "low or undetectable," when used in reference to 
10 gene expression level, refers to a level of expression which is either significantly 
lower than that seen when the gene is maximally induced {e.g., at least five fold 
lower) or is not readily detectable by the methods used in the following examples 
section. 

As used herein, the phrase "individual" refers to multicellular organisms and 

15 includes both plants and animals. Preferred multicellular organisms are animals, 
more preferred are vertebrates, even more preferred are mammals, and most 
preferred are humans. 

As used herein, the term "pCytTs" means any temperature sensitive, non- 
cytopathic inducible alphaviral expression system containing a functional promoter 

20 to drive the transcription of mRNA from the nucleic acid molecule of the invention. 
Examples for such expression systems or vectors may be, e.g., the vector pCytTs 
(SEQ ID NO: 1) containing a RSV promoter or the vector pCytTs2.1 (SEQ ID NO: 
2) containing a CMV promoter. When combinatorial constructs with pCytTs are 
mentioned in the Examples, they generally contain a CMV promoter, even if the 

25 abbreviation of the combinatorial constructs does — for the sake of simplicity - only 
refer to pCytTs and not to pCytTs2. 1 . 

As used herein, the phrase "cw-acting" sequence refers to nucleic acid 
sequences to which a replicase binds to catalyze the RNA-dependent replication of 
RNA molecules. These replication events result in the replication of the full-length 

30 and partial RNA molecules and, thus, the alphavirus subgenomic promoter is also a 
"c/5-acting" sequence. C/5-acting sequences may be located at or near the 5' end, 3' 
end, or both ends of a nucleic acid molecule, as well as intemally. 
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As used herein, the phrase "RNA-dependent RNA polymerase" refers to a 
polymerase which catalyzes the production of an RNA molecule from another RNA 
molecule. This term is used herein sjoionymously with the term "replicase." 

As used herein, the phrase "non-infective packaged RNA molecules" refers to 
5 packaged RNA molecules which will essentially undergo only one round of host cell 
infection and are not pathogenic. These molecules are thus "infective" but only for a 
single infectious entry into a host cell and are not capable of reproducing to form 
additional infectious particles. 

As used herein, the term "transcription" refers to the production of RNA 
10 molecules from DNA templates catalyzed by RNA polymerases. 

As used herein, the phrase "RNA-dependent RNA replication event" refers to 
processes which result in the formation of an RNA molecule using an RNA molecule 
as a template. 

As used herein, the term "vector" refers to an a polynucleotide construct, 
15 typically a plasmid or a virus, used to transmit genetic material to a host cell. 
Preferably, the term "vector" as used herein refers to an agent such as a plasmid, and 
even more preferably to a circular plasmid, A vector as used herein may be 
composed of either DNA or RNA. Preferably, a vector as used herein is composed of 
DNA. 

20 As used herein, the term "heterologous sequence" refers to a second 

nucleotide sequence present in a vector of the invention. The term "heterologous 
sequence" also refers to any amino acid or RNA sequence encoded by a heterologous 
DNA sequence contained in a vector of the invention. Heterologous nucleotide 
sequences can encode polypeptides or RNA molecules normally expressed in the cell 

25 type in which they are present or molecules not normally expressed therein (e.g., 
Sindbis structural proteins). 

As used herein, the phrase "imtranslated RNA" refers to an RNA sequence or 
molecule which does not contain an open reading frame or encodes an open reading 
frame, or portion thereof, but in a format in which an amino acid sequence will not 

30 be produced (e.g,, no initiation codon is present). Examples of such molecules are 
tRNA molecules, rRNA molecules, and ribozymes. Antisense RNA may be 
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untranslated but, in some instances antisense sequences can be converted to a 
translatable sense strand from which a polypeptide is produced. 

As used herein, the phrase "temperature-sensitive" refers to an enzyme which 
readily catalyzes a reaction at one temperature but catalyzes the same reaction slowly 
5 or not at all at another temperature. An example of a temperature-sensitive enzyme 
is the replicase protein encoded by the pCytTs vector, which has readily detectable 
replicase activity at temperatures below and has low or undetectable activity at 
37°C. 

As used herein, the phrase "permissive temperature" refers to temperatures at 

10 which an enzyme has relatively high levels of catalytic activity. 

As used herein, the phrase "restrictive temperature" refers to temperatures at 
which an enzyme has low or undetectable levels of catalytic activity. Both "hot" and 
"cold" sensitive mutants are known and, thus, a restrictive temperature may be higher 
or lower than a permissive temperature. 

15 As used herein, the term "recombinant host cell" refers to a host cell into 

which one ore more nucleic acid molecules of the invention have been introduced. 

As used herein, the term "replication initiation factor" refers to a protein or a 
DNA sequence encoding such a protein that is able to recognize an origin of 
replication leading to replication of plasmids containing this origin of replication. 

20 As used herein, the term "origin of replication" refers to a DNA sequence 

that is recognized by a replication initiation factor or a DNA replicase leading to 
replication of a plasmid containing the origin of replication. The expression 
"recognized by a replication initiation factor" is intended to mean that a replication 
initiation factor can physically interact with all or a portion of an origin of replication 

25 sequence, thereby causing or stimulating molecular mechanisms that ultimately 
cause all or a portion of the DNA molecule comprising the origin of replication to be 
replicated. 

As used herein the term "inducible" means that in an inactive state, the basal 
activity of a system should be minimal, while in the active state high levels of gene 
30 expression should be rapidly inducible. 

When the terms "one," "a," or "an" are used in this disclosure, they mean "at 
least one" or "one or more," unless otherwise indicated. 
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NUCLEIC ACID MOLECULES AND EXPRESSION SYSTEMS 

Certain preferred embodiments of the present invention are based on the 
vectors used in the previously described pCytTs expression system (WO 99/50432, 
5 the disclosure of which is hereby incorporated by reference in its entirety). These 
vectors are constitutively transcribed in host cells to produce mRNA molecules 
having two open reading frames. These open reading frames, which may or may not 
be produced from the same nucleic acid molecule, encode a temperature-sensitive 
replicase and a heterologous gene of interest. The first open reading frame is 

10 translated to produce an RNA-dependent RNA polymerase. The second open 
reading frame, encoding all or part of one or more polypeptides of interest, is not 
translated until after at least one RNA-dependent RNA replication event. 

The nucleic acid molecule of the present invention comprise, inter alia, at 
least one second nucleotide sequence. The second nucleotide sequence is preferably 

15 operably linked a promoter which is recognized by an RNA-dependent RNA 
polymerase. The second nucleotide sequence, in certain embodiments, may comprise 
a second open reading frame, or other nucleotide sequence of interest, as well as 
other elements. For instance, the second nucleotide sequence may comprise a 5' 
promoter which is capable of initiating synthesis of RNA in v/vo, 5' and/or 3' 

20 sequences enabling replication of the RNA molecule (5* and 3' cis acting sequence 
elements), as well as a sequence of interest which is translatable only after at least 
one replication event. Replication is catalyzed by an RNA-dependent RNA 
polymerase which is encoded altematively on the same or on a different mRNA 
molecule. The sequence of interest may be encoded in sense, plus (+), orientation 

25 downstream of a viral RNA promoter. Translation of the coding sequence of the 
gene of interest is inhibited by a 5' sequence which, in the case of the single-vector 
system, will generally be the replicase sequence. In the mxiltiple-vector system, a 5' 
sequence can inhibit translation by having one or more short open reading frames 
with associated stop codons which lead to the detachment of ribosomes. Similarly, 

30 any sequence which inhibits the traveling or binding of ribosomes to the sequence of 
interest can be used as a 5* sequence which inhibits translation (Voet and Voet, 
Biochemistry, John Wiley & Sons, Inc. (1990)). 
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Another method for preventing translation of nucleotide sequences in most 
biological systems involves the insertion of the sequence in an antisense direction. 
This method of inhibiting translation is based on the principle that translation will 
generally only occur after the replication of this minus (-) strand RNA into a plus 
5 strand having an open reading frame in a sense orientation. The translated sense 
strand is formed by RNA replication and serves as a template for ribosomes and 
protein synthesis. As shown previously (WO 99/50432) production of amino acid 
sequences can occur even when the gene of interest is inserted into the DNA 
molecule in an orientation which will result in the formation of antisense RNA 

10 sequence 3' to the subgenomic promoter. Thus, the second open reading frame may 
also comprise a sequence complementary to all or part of the second open reading 
frame described above and expression of the encoded amino acid sequence will still 
occur. When the production of an untranslated antisense RNA sequence is desired, 
the RNA molecule can be designed so that it will not serve as a template for protein 

1 5 synthesis. For example, the RNA can be designed so that an initiation codon is not 
present. 

The second nucleotide sequence may altematively or additionally comprise a 
nucleotide sequence encoding an untranslated RNA molecule or complement thereof. 
For example, RNA molecules directly produced by transcription of a DNA sequence 

20 of the invention may encode RNA sequences which are neither translated nor present 
in an antisense orientation. Examples of such untranslated RNA molecules include 
tRNA molecules, rRNA molecules, and ribozymes. A considerable number of 
ribozyme sequences with defined catal54ic activities are known in the art {see, e.g.. 
Brown, J., Nucleic Acids Res, 25:351-352 (1998); Xie, Y. et al, Proc, Natl Acad, 

25 ScL USA P^:13777-13781 (1997); Lavrovsky. Y et ai, Biochem, Mol Med, 62:1 1-22 
(1997); Chapman, K. and Szostak, J., Chem, Biol, 2:325-333 (1995)). Further, 
ribozymes have been used to "knockout" the expression of a specific gene in 
eukaryotic cells as part of a ribozyme-mediated, message deletion strategy (Xie, Y. et 
al, Proc, Natl Acad, ScL USA 94:13777-13781 (1997)). Additionally, alphaviral 

30 replicons have been used to express a functional ribozyme in mammalian cells 
(Smith S. et al, J, Virol 77:9713-9721 (1997)), The regulated expression of such 
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ribozymes, and other untranslated RNA molecules, is thus within the scope of the 
present invention. 

The nucleic acid molecules and expression systems of the invention also 
comprise a second polynucleotide element comprising an origin of replication. 
5 Origins of replication that can be used with the present invention include, e.g.^ those 
that are derived from prokaryotic organisms, eukaryotic organisms {e.g.^ yeasts, 
insects, or mammals) and/or viruses. For example, an origin of replication that may 
be used with the present invention is one that is derived from a DNA virus such as, 
e.g., a Herpesvirus, a Papillomavirus, an Adenovirus, or a Hepadnavirus. In certain 

10 embodiments, the origin of replication that is used with the invention is a DNA virus 
that allows for episomal replication, Li a preferred embodiment, the origin of 
replication is derived from Epstein-Barr virus (EBV), such as, e.g., or/P. 

The nucleic acid molecules and expression systems of the invention also 
comprise a third polynucleotide element encoding a replication initiation factor. The 

15 replication initiation factor will be capable of recognizing the origin of replication 
that is included within the nucleic acid molecule or expression system. The third 
polynucleotide element of the invention may encode, e.g., a replication initiation 
factor derived from prokaryotic organism, eukaryotic organism {e.g., yeasts, insects, 
and/or mammals) and/or viruses. For example, a replication initiation factor that 

20 may be used with the present invention is one that is derived from a DNA virus such 
as, e.g., a Herpesvirus, a Papillomavirus, an Adenovirus, or a Hepadnavirus, In 
certain embodiments, the replication initiation factor that is used with the invention is 
capable of operating as a plasmid maintenance factor. In a preferred embodiment, 
the replication initiation factor is derived from Epstein-Barr virus (EBV), such as, 

25 e.g, , the EBNA-1 protein, or a portion thereof. 

The replication initiation factor {e.g., a nucleotide sequence that encodes a 
replication initiation factor) and the origin of replication that are included within the 
nucleic acid molecules and expression systems of the present invention may be 
derived from the same organism or from the same virus. Altematively, the 

30 replication initiation factor and the origin of replication may be derived from 
different organisms or from different viruses. 
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In an exemplary embodiment of the invention, nucleic acid molecules are 
provided, which are constructed by introducing defined Epstein-Barr virus (EBV) 
sequences into vectors such as, e.g., those included within the pCytTs system. The 
resulting exemplary nucleic acid molecules of the invention are capable of 
5 replicating as non-integrated autonomous episomal molecules in the transformed host 
cells. 

The defined sequences, which may be introduced into the vectors of the 
pCj^Ts system, comprise, e.g., either one or two elements of Epstein-Barr virus, 
(OriP alone or together with EBNA-1 gene) which permit plasmid maintenance. The 

10 origin of replication OriP is a cis-acting sequence and needs to be inserted into the 
plasmid vector sequence. Plasmids containing this origin of replication are able to be 
maintained in cells expressing the replication initiation factor which recognizes the 
origin of replication; one of these factors is EBNA-1. EBNA-1 gene function can be 
provided in cis by introducing the sequence into the same plasmid or can be provided 

15 in trans by co-transfection with a second replicating or non-replicating plasmid as 
well as by providing it firom stably transduced cell lines expressing EBNA-1 gene 
from an integrated copy. Furthermore, in certain embodiments, more than one copy 
of the sequence expressing the replication initiation factor, e.g. EBNA-1, is provided. 
For example, EBNA-1 gene function may be provided on one or more plasmids, 

20 wherein none, some or all of those plasmids may additionally comprise the origin of 
replication, and which plasmids are used for transfecting cell lines, preferably cell 
lines that have the sequence of EBNA-1 stably integrated within its genome. The 
presence of both OriP and EBNAl sequences in the same plasmid renders replication 
less dependent on the host cell type. (Reisman, D. et al^ MoL Cell Biol, 5: 1822- 

25 1832 (1985); Yates, J.L. et al. Nature 575:812-815 (1985); U.S. Patent No. 
4,686,186). U.S. Patent No. 4,686,186 describes the transfection of cells with a 
single plasmid containing the EBV OriP, the EBNA-1 gene and a gene encoding a 
protein of interest. 

Although such cell lines have the advantage of stable long-term expression of 
30 the replication initiation factor and durable support of replication and maintenance of 
OriP containing plasmids, there are not many EBNA-1 expressing cell lines 
commercially available (ATCC : 293HEK-EBNA1 and CVl-EBNAl). Altematively, 
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plasmids that already carry the EBNA-1 gene and the gene of interest in cis on the 
same episome are used to transfect cells and commercial vectors such as pCEP4 
(Invitrogen) are available. However, current vectors designed for constitutive 
expression of polypeptides carrying OriP and EBNA-1 on the episomal construct in 
5 cis and wherein the expression is not regulatable or inducible, are not applicable in 
cases where cell-toxic polypeptides need to be expressed. 

To our knowledge, an inducible expression system having the features of a 
self-replicating DNA episome and that allows the expression of a polypeptide or 
untranslated RNA sequence of interest from a self-replicating RNA molecule, 

10 triggered by a simple temperature shift, has not been described. 

Certain embodiments of the invention are directed to nucleic acid molecules 
which are transcribed to produce a mRNA molecule having two open reading frames, 
e,g.^ one open reading frame which encodes a replicase, and another open reading 
frame which encodes a nucleotide sequence of interest. The nucleic acid molecules 

15 of the invention may contain a promoter sequence which drives transcription to 
produce mRNA molecules having coding sequences of both open reading frames. 
The mRNA sequences of the first open reading frame are preferably translated to 
produce a replicase required for the expression of the RNA sequences of the second 
open reading frame. The second open reading frame preferably encodes one or more 

20 polypeptides of interest. In addition, the vectors contain an origin of replication and a 
DNA polymerase recognizing the origin of replication. 

Further, once the first mRNA molecule has been transcribed from the DNA 
vector, additional RNA-dependent RNA replication events may occur to amplify the 
first mRNA sequence and to produce RNA molecules with strand polarity which is 

25 the opposite of the first mRNA sequence. 

The second open reading frame of the nucleic acid molecules of the invention 
will preferably only be expressed after partial replication of a fiill-length RNA 
molecule. This partial replication of the fiill-length RNA molecules is driven by a 
promoter sequence composed of RNA {e,g,^ an alphaviral subgenomic promoter 

30 sequence). In addition, the plasmid encoding the replicase and the gene of interest 
will preferably replicate during division of the host cell due to the presence of origin 
of replication and a protein factor recognizing the origin of replication. 
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While the gene of interest may be encoded by the same RNA molecule as the 
RNA-dependent RNA replicase protem, this gene may also be encoded by a separate 
RNA molecule. Thus, the invention further provides both single- and 
multiple-vectors systems for expressing a gene of interest. Moreover, while all 
5 plasmids used preferably contain an origin of replication, the gene encoding the 
replication initiation factor may be on a separate plasmid or stably integrated into the 
genome of the host cell. 

In a single-vector system of the invention, sequences encoding the first open 
reading frame, the second nucleotide sequence are components of the same nucleic 

10 acid molecule, the origin of replication and the initiation factor. Thus, all of the 
components required for regulated expression of the gene of interest and plasmid 
amplification are contained in a single nucleic acid molecule (/.e., DNA or RNA). 

In a multiple-vector system of the invention, sequences encoding the first 
open reading frame, or sub-portions thereof, and the second nucleotide sequence are 

15 components of different nucleic acid molecules. These multiple-vector systems thus 
may comprise two or more nucleic acid molecules. For example, the replicase and 
the gene of interest can each be encoded by different nucleic acid molecules. 
However, all vectors may contain an origin of replication. 

While any functional promoter can be used to drive the transcription of 

20 mRNA from the nucleic acid molecule of the invention, the promoter is preferably a 
constitutive RNA polymerase II promoter (e.g., Rous Sarcoma Virus (RSV), 
cytomegalovirus (CMV), simian virus 40 (SV40), myeloproliferative sarcoma virus 
(MPSV), glucocorticoid, metallothionein, Herpes simplex virus thymidine kinase 
(HSVTK), human immune deficiency (HIV), mouse mammary tumor virus 

25 (MMTV), human polyomavirus BK (BKV), or Moloney murine leukemia virus 
(MuLV) promoter). Additional promoters suitable for use in the practice of the 
present invention are known in the art (see, e.g., Lee, A. et ai, Mol Cells. 7:495-501 
(1997)). 

The nucleic acid molecules of the invention may further comprise a fourth 
30 polynucleotide element, wherein said fourth polynucleotide element comprises a 
selection marker. The selection marker may facilitate the cloning and amplification 
of the vector sequences in prokaryotic and eukaryotic organisms. In certain 
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embodiments, the selection marker will confer resistance to a compound or class of 
compounds, such as an antibiotic. An exemplary selection marker that can be used 
with the nucleic acid molecules and expression systems of the present invention is 
one that confers resistance to puromycin. Alternatively, selection markers may be 
5 used that confer resistance to hygromycin, gpt, neomycin, zeocin, ouabain, 
blasticidin, or bleomycin, or markers such as DHRF, hisD, trpB, or glutamine 
synthetase. 

The pCytTs vector contains an ampicillin resistance marker for positive 
selection in bacterial host cells and an E. coli origin of replication {Le.^ ColEl). A 

10 considerable number of sequences encoding additional selection markers and origins 
of replication are known in the art {see, e,g., Sambrook, J. et al., eds.. Molecular 
Cloning, A Laboratory Manual, 2nd. edition. Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y. (1989); Ausubel, F. et al., eds., CURRENT 
Protocols in Molecular Biology, John H. Wiley & Sons, Inc. (1997)). 

15 The nucleic acid molecules and expression systems of the invention comprise 

a first nucleotide sequence comprising a first open reading frame, said first open 
reading frame having a nucleotide sequence encoding a replicase, e.g.^ an RNA- 
dependent RNA polymerase. The first nucleotide sequence may, in certain 
embodiments, also comprise 5' and 3' c/s'-acting sequences, and junction sequences 

20 containing a subgenomic promoter. 

The RNA-dependent RNA polymerases that can be used with the present 
invention may be of viral origin. The RNA-dependent RNA polymerase may be 
derived, e.g., from virus selected from the group consisting of: Bebaru virus, 
Cabassou virus, Chikungunya virus, Easter equine encephalomyelitis virus. Fort 

25 morgan virus, Getah virus, Kyzylagach virus, Mayoaro virus, Middleburg virus, 
Mucambo vims, Ndumu virus, Pixima virus, Tonate virus, Triniti virus, Una virus, 
Westem equine encephalomyelitis vims, Whataroa vims, Venezuelan equine 
encephalomyelitis virus (VEE), and Ross River vims. In a preferred embodiment, 
the RNA-dependent RNA polymerase of the invention is derived from an alphavims 

30 such as, e.g., a Sindbis virus, a Semliki Forest vims, and/or an Aura vims. 

In certain embodiments of the invention, the RNA-dependent RNA 
polymerase that is encoded by the first open reading frame of the nucleic acid 
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molecules of the invention will be a non-cytopathic RNA-dependent RNA 
polymerase. The RNA-dependent RNA polymerase may, alternatively, be a 
temperature sensitive RNA-dependent RNA polymerase. In other embodiments, the 
RNA-dependent RNA polymerase may be a non-cytopathic, temperature-sensitive 
5 RNA-dependent RNA polymerase. 

When the RNA-dependent RNA polymerase that is encoded by the first open 
reading fi-ame of the nucleic acid molecules of the invention is a temperature- 
sensitive RNA-dependent RNA polymerase, the RNA-dependent RNA polymerase 
may, e.g., have replicase activity at temperatures below 34°C and low or 
10 undetectable replicase activity at 34°C or above. In certain embodiments, the 
temperature sensitive RNA-dependent RNA polymerase will be one that has 
replicase activity at 34°C which is at least five fold lower than the replicase activity 
exhibited at 29''C. 

Preferred mutations which confer a non-cytopathic phenotype are those that 

15 are in the nsp2 gene (e.g., the proline residue at position 726 is replaced with a serine 
or a leucine residue). Mutations are known in the art which render the replicase 
protein non-cytopathic (Weiss et al, J. Virol ii:463-474 (1980); Dryga et al. 
Virology 228:74-S3 (1997); Agapov et al., PNAS PJ: 12989-12994 (1998)). In 
addition, mutations may be introduced into the nuclear localization signal of an 

20 alphaviral replicase which diminish toxicity of the replicon by, e.g.^ modifying the 
distribution of the replicase-encoding sequence between the cytoplasm and the 
nucleus. (Rikkonen a/., F/ro/agy 27(^:352-361 (1996)). 

Mutations which render a replicase non-cytopathic may be introduced by a 
nxmiber of means, including site directed mutagenesis. Additional mutations that 

25 render an RNA-dependent RNA polymerase non-cytopathic can be identified by, 
e.g., random mutagenesis and screening using methods and techniques that are well- 
known in the art. As will be imderstood by persons having ordinary skill in the art, 
the identification of additional mutations that render an RNA-dependent RNA 
polymerase non-cytopathic, temperature sensitive, or both, does not require an ability 

30 to predict the structural or functional consequences of any particular mutation. 

About 20 years ago Weiss et al. (Weiss, B. et al, J. Virol 55:463-474 
(1980)) established a persistently infected culture of BHK cells. The mutation 
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responsible for this phenotype has been recently identified (Dryga, S.A. et aL, 
Virology 225:74-83 (1997)). Another mutation allowing the regulation of the mRNA 
transcription via temperature shifts was identified by Burge and Pfefferkom (Burge, 
B.W. & Pfefferkom, E.R., Virology 50:203-214 (1966)) and described in more detail 
5 by Xiong et al (Xiong, C. et al. Science 2^5:1 188-1 191 (1989)). 

Temperature sensitivity {ts) may be conferred, for example, by the 
introduction of a mutation in the nsp4 gene of the replicase. Preferably, mutations 
which confer a temperature-sensitive phenotype upon replicase activities are in a 
protein in complementation group F (Lemm et al, J, Virol (^4:3001-3011 (1990)). 

10 For example, a temperature-sensitive phenotype may be conferred by changing Gly 
153 of nsp4 to Glu. Additionally, any other mutation which renders replicase 
activity temperature-sensitive can be used in the practice of the invention. Methods 
for creating and identifying new temperature-sensitive mutants are described by 
Pfefferkom (Burge and Pfefferkom, Virol 50:204-213(1966); Burge and 

15 Pfefferkom, Virol 50:214-223 (1966)). Other methods will be appreciated by those 
of ordinary skill in the art. 

While most temperature-sensitive mutants are "hot" sensitive, "cold" 
sensitive mutations are also known {see, e.g., Schwer, B. et al, Nucleic Acids Res. 
2(5:803-809 (1998), Mathe, E. et al, J. Cell ScL 7/7:887-896 (1998), Doedens, J. et 

20 al, J. Virol 77:9054-9064 (1997), Patterson, B. et al, J, Biol Chem. 272:27612- 
27617 (1997)). The temperature-sensitive replicase may be "cold" or "hot" sensitive 
and thus will catalyze RNA replication only at temperatures either above or below 
restrictive temperatures. In one embodiment, RNA replication occurs at detectable 
levels only at temperatures lower than 34*^0. In contrast to all previously known 

25 regulatable DNA expression systems, the basal level of expression in recombinant 
host cells containing the pCytTs vector in the inactive state at 37°C is below the level 
of detection using standard methods (e.g., those used in the following examples) 
even imder condition of transient transfection. This low level of expression is 
apparent from the data presented in Table 1 . 

30 The production of additional temperature sensitive and/or non-cytopathic 

replicases has been described in the art. (Lundstrom et al.. Gene Ther, Mol Biol 
^.23-31 (1999), Lundstrom et a/., Histochem, Cell BioL 775:83-91 (2001)). 
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Moreover, with respect to alphaviruses, the resuhs obtained in one t3^e of alphavirus 
can often be used to predict the results obtained in other alphaviruses. That is, the 
identification of mutations that render a replicase temperature-sensitive and/or non- 
cytopathic in one alphavirus type may be used to identify or create a similar 
5 temperature-sensitive and/or non-cytopathic replicase in a different alphavirus. This 
ability is related, in part, to the high level of sequence identity observed among 
alphavirus genomes, especially in their replicase-encoding sequences. (Weaver et aL, 
J, Virol 77:613-623 (1997), Smyth et aL, J. Virol, 77:818-823 (1997), Kuhn et a/., J, 
ViroL 70:7900-7909(1996)). 

10 In certain embodiments, the first nucleotide sequence (comprising a first open 

reading fi-ame encoding an RNA-dependent RNA polymerase) and the second 
nucleotide sequence (comprising, e.g., a second open reading frame encoding a 
polypeptide) are contained on two separate nucleic acid molecules. In such an 
instance, the second nucleotide sequence may carry both ci^-acting sequences and a 

15 5* region which inhibits translation of the sequence of interest. The first nucleotide 
sequence can also be encoded by a nucleic acid molecule which is different than the 
second nucleotide sequence. Replication and translation of the second nucleotide 
sequence in this multi-vectors system can be regulatable by temperature if a 
temperature sensitive RNA-dependent RNA polymerase is used. 

20 Additional mutations that render an RNA-dependent RNA polymerase non- 

cytopathic, temperature-sensitive, or both, can be identified by, e.g., random 
mutagenesis and screening using methods and techniques that are well-known in the 
art. As will be understood by persons having ordinary skill in the art, the 
identification of additional mutations that render an RNA-dependent RNA 

25 polymerase non-cytopathic, temperature sensitive, or both, does not require an ability 
to predict the structural or fimctional consequences of any particular mutation. The 
identification of an RNA-dependent RNA polymerase that is non-cytopathic, 
temperature-sensitive, or both, only involves, e.g., the generation of random 
mutations and the fimctional testing of the replicases that are encoded from the 

30 mutated nucleic acid molecules. Such screening methods are routine in the art. 

The nucleic acid molecules and expression systems of the invention can be 
also used to express of more than one gene of interest. For example, recombinant 
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host cells can be transfected with more than one nucleic acid molecule of the 
invention wherein one nucleic acid molecule encodes both the replicase and a 
polypeptide of interest and additional nucleic acid molecules may encode additional 
pol5^eptides of interest. Similarly, when mutations conferring non-cytopathicity and 
5 temperature sensitivity are both used, genes encoding polypeptides having suitable 
mutations (e.g.. Pro 726 Ser in nsp2 and Gly 153 Glu in nsp4) may be on separate 
nucleic acid molecules. Additional variations would be apparent to those skilled in 
the art. 

The nucleic acid molecules of the invention may also contain packaging signals 

10 which direct the packaging of RNA molecules into viral particles. These RNA 
molecules can be packaged in the presence of wdld-type vims or defective helper 
virus RNA. A significant improvement was made with the development of defective 
helper RNA molecules (Bredenbeek, P. et al, J. Virol 67:6439-6446 (1993)). These 
RNA molecules contain c/^-acting sequences, required for replication of the fuU- 

15 length transcription product, and subgenomic RNA promoter sequences which drive 
the expression of the structural protein genes. For example, in cells containing both 
RNA molecules with packaging signals and the defective helper virus RNA, 
alphaviral non-structural proteins allow for replication and amplification of the 
defective helper virus RNA sequences which are translated to produce virion 

20 structural proteins. Since the helper virus RNA lacks packaging signals, these 
molecules are not packaged into assembled virions. Thus, virion particles produced 
in this way contain essentially only RNA sequences encoding the gene of interest 
and, generally, other sequences required for temperature-sensitive regulation of gene 
expression. These non-infective packaged RNA molecules do not contain sequences 

25 encoding virion structural proteins and, thus, xmdergo only one round of host cell 
infection and are not pathogenic. 

Non-infective packaged RNA molecules can be used to infect a culture of 
suitable host cells simply by adding the particles to culture medium containing these 
cells. The preparation of non-infective alpahviral particles is described in a number 

30 of sources, including "Sindbis Expression System", Version C, (Invitrogen Catalog 
No. K750-1). 
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One exemplary application of this system is directed to the temperature- 
dependent production of non-infective, packaged RNA molecules. These packaged 
RNA molecules may be produced by a number of means including using 
recombinant host cells containing two different nucleic acid molecules (e.g., a 
5 nucleic acid molecule of the invention and a nucleic acid molecule encoding a helper 
virus RNA sequence). For example, one of these nucleic acid molecules will encode, 
e.g., an RNA molecule which contains packaging signal sequences, sequences 
encoding a non-cytopathic, temperature-sensitive replicase, and the gene of interest. 
The other nucleic acid molecule will contain sequences encoding alphaviral 

10 structural proteins downstream from an alphavirus subgenomic promoter. Using 
such a system, viral particles containing only RNA molecules with packaging signals 
will be produced at permissive temperatures in recombinant host cells. This is so 
because alphaviral structural proteins will only be produced at a permissive 
temperature. Additional variations of the above would be apparent to one skilled in 

15 the art. 

The second nucleotide sequence that is included within the nucleic acid 
molecules and expression systems of the invention may be, e.g., a second open 
reading frame encoding a polypeptide. Such a second open reading frame may, 
alternatively or interchangeably, be referred to as a nucleotide sequence of interest. A 

20 wide variety of nucleotide sequences of interest can be expressed by the nucleic acid 
molecules and expression systems of the invention. These sequences include, but are 
not limited to, sequences encoding lymphokines, cytokines, toxins, enzymes, prodrug 
converting enzymes, antigens which stimulate immvme responses, single chain 
antibodies, polypeptides which stimulate or inhibit immune responses, tumor 

25 necrosis factors, and various proteins with therapeutic applications (e.g., growth 
hormones and regulatory factors). 

In certain embodiments, the second open reading frame (the second 
nucleotide sequence) of the invention may encode, e.g., a cytokine or lymphokine 
(e.g., p-interferon). Hematopoiesis is regulated by lymphokines and cytokines which 

30 stimulate the proliferation and/or differentiation of various hemopoietic cells. 
Representative examples of cytokines and lymphokines include interleukin- 1 (IL-1), 
interleukin-2 (IL-2), interleukin-3 (IL-3), interleukin-4 (IL-4), interleukin-5 (IL-5), 
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interleukin-6 (IL-6), interleukin-7 (IL-7), interleukin-8 (IL-8), interleukin-9 (IL-9), 
interleukin-lO (IL-10), interleukin-l 1 (IL-11), interleukin-12 (IL-12), interleukin-13 
(IL-13), interleukin-14 (IL-14), interleukin-l 5 (IL-15), interleukin-l 6 (IL-16), 
interleukin-l 7 (IL-17), granulocyte colony stimulating factor (G-CSF), granulocj^e- 
5 macrophage colony stimulating factor (GM-CSF), macrophage colony stimulating 
factor (M-CSF), and interferons. 

In certain embodiments, the second open reading frame (the second 
nucleotide sequence) of the invention may encode secreted enzjones (e.g., secreted 
alkaline phosphatase), cytoplasmic enzymes (e.g., green fluorescent protein), or any 

10 number of other proteins with therapeutic applications (e.g., human insulin, human 
coagulation Factor VIII). 

The nucleic acid molecules and expression systems of the invention can also 
be used to express heterologous sequence encoding cytotoxic polypeptides. 
Cytotoxic polypeptides act to directly or indirectly inhibit cellular growth or 

15 metabolism. Representative examples of toxins include Shigella toxin, ricin. 
Diphtheria toxin, Cholera toxin, Pseudomonas exotoxin A, and Herpes simplex virus 
thymidine kinase (HSVTK). Within other embodiments of this invention, the 
heterologous sequence encodes a prodrug converting enzyme. A prodrug converting 
enzyme activates a compound with little or no cytotoxicity into a toxic product. 

20 Representative examples are HSVTK, alkaline phosphatase, guanine phosphoribosyl 
transferase, and penicillin-V amidase. Examples of both cytotoxic polypeptides and 
prodrug converting enzymes are discussed in numerous sources including 
PCT/US97/06010, EP 0716148, and WO 96/17072. In addition, a vast array of 
signaling molecules and membrane proteins 2ire toxic if expressed at high levels. All 

25 these molecules may be suitable for expression using the system of the present 
invention. 

The second nucleotide sequence of the nucleic acid molecules and expression 
systems of the invention may also be a nucleotide sequence encoding an untranslated 
RNA molecule or complement thereof. Exemplary untranslated RNA molecules 
30 include, e.g., antisense sequences, RNase P targeted sequences which induce gene 
dovm-regulation, and ribozymes. Smith S. et al {J. Virol 77:9713-9721 (1997)) 
describes alphaviral vectors used to express ribozyme sequences. 
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The nucleic acid molecules and expression systems of the invention can also 
be used to express virtually any polypeptide, including ones which have not as yet 
been identified but are encoded by nucleotide sequences contained in, for example, 
cDNA libraries or host cell chromosomes. Example of such polypeptides include 
5 secreted proteins and proteins Jfrom various cellular compartments. Heterologous 
sequences expressed by the vectors of the invention can encode polypeptides and 
RNA molecules from non-human species (e.g., other mammals, plants, fungi, 
bacteria or viruses). These heterologous sequences may further encode viral 
membrane proteins (e.g., HIV gpl60) or viral polyproteins (e.g,, Sindbis structural 
10 proteins). 

Nucleotide sequences may be added to the nucleic acid molecules and vectors 
of the invention which result in the production of a fusion protein. For example, 
such sequences can encode amino acids sequences which are fused to a protein 
encoded by a gene of interest and confer one or more functional characteristics upon 

15 the expression product. These amino acid sequences include sequences which will 
target the gene product for export from the cell (e.g.^ a secretory sequence) or to a 
subcellular compartment (e,g.^ the nucleus). Such amino acid sequences further 
include sequences which facilitate purification (e.g., a six His "tag"). Depending on 
the amino acid sequence and the function imparted by the fused sequence, the added 

20 amino acid sequences may or may not be cleaved from the translation product. 

Fusion proteins also include proteins which have domains or regions derived 
from various different proteins. Examples of such a fusion protein are those 
containing domain II of Pseudomonas exotoxin, a domain or amino acid sequence 
which has binding affinity for a cell surface receptor associated with a particular cell 

25 type, and another amino acid sequence with a preselected biological activity. 
Domain II of Pseudomonas exotoxin will translocate across cell membranes. Using 
this system, fusion proteins can be designed which will bind to specific cells types, 
will translocate across the cytoplasmic membranes of these cells, and will catalyze 
predetermined intracellular biological reactions. A system of this type is described in 

30 Pastan et al, U.S. Patent No. 5,705,163. Methods for identifying amino acid 
sequences, which bind to specific cell types are described in Wu, A., Nature Biotech. 
7^:429-431 (1996). 
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The nucleic acid molecules and expression systems of the invention may, in 
certain embodiments, further comprise genetic elements which confer additional 
functional characteristics such as selection markers. 

Markers for the selection of prokaryotic and eukaryotic cells containing 
5 vectors the present invention are well known in the art and include puromycin, 
tetracycline, ampicillin, neomycin, hygromycin, gpt, zeocin, ouabain, blasticidin, 
bleomycin, and kanamycin resistance. Alternatively, selection markers such as 
DHRF, hisD, trpB, or glutamine synthetase may be used. Nucleotide sequences 
which result in high copy number amplification are also known in the art and include 
10 the ColEl sequence contained in the pCytTs vector. 



RECOMBINANT HOST CELLS 

15 The invention includes methods of making recombinant host cells and 

recombinant host cells produced using the methods of the invention. For example, 
the methods of the invention comprise introducing one or more nucleic acid 
molecules or expression systems described herein into a host cell. A variety of 
different recombinant host cells can be produced which contain the nucleic acid 

20 molecules and expression systems of the invention, Alphaviruses, for example, are 
known to have a wide host range. Sindbis virus, for example, infects cultured 
mammalian, reptilian, and amphibian cells, as well as some insect cells (Clark, H., J, 
Natl Cancer Inst. 57:645 (1973); Leake, C, J. Gen, Virol 55:335 (1977); Stollar, V. 
in The TOGAVIRUSES, R.W. Schlesinger, Ed., Academic Press, (1980), pp.583-621). 

25 Thus, numerous host cells can be used in the practice of the invention. 

Representative host cells that may be used with the invention include, but are not 
limited to, bacterial cells, yeast cells, plant cells and animal cells. Preferred bacterial 
host cells include Escherichia spp, cells (particularly E. coli cells and most 
particularly E. coli strains DHIOB, Stbl2, DH5, DB3, DB3.1, DB4 and DB5), 

30 Bacillus spp. cells (particularly B. subtilis and B. megaterium cells), Streptomyces 
spp. cells, Erwinia spp. cells, Klebsiella spp. cells, Serratia spp. cells (particularly S. 
marcessans cells), Pseudomonas spp. cells (particularly P. aeruginosa cells), and 
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Salmonella spp. cells (particularly S. typhimurium and S. typhi cells). Preferred 
animal host cells include insect cells (most particularly Drosophila melanogaster 
cells, Spodoptera finigiperda Sf9 and Sf21 cells and Trichoplusa High-Five cells), 
nematode cells (particularly C. elegans cells), avian cells, amphibian cells 
5 (particularly Xenopus laevis cells), reptilian cells, and mammalian cells (most 
particularly Nffl3T3, CHO, COS, VERO, BHK, HEK, other rodent cells, and human 
cells). Preferred yeast host cells include Saccharomyces cerevisiae cells and Pichia 
pastoris cells. BHK, COS, Vero, HeLa and CHO cells are particularly suitable for 
the production of heterologous polypeptides because they have the potential to 

10 glycosylate heterologous proteins in a manner similar to human cells (Watson, E. et 
al, Glycobiology 4:227, (1994)) and can be selected (Zang, M. et al, 
Bio/Technology 75:389 (1995)) or genetically engineered (Renner W. et ai, Biotech. 
Bioeng, 47A16 (1995); Lee K. et al Biotech Bioeng. 50:336 (1996)) to grow in 
serum-free medium, as well as in suspension. 

15 Example 2 shows that the expression system of the present invention can 

function in BHK cells and 293HEK cells. Moreover, we have previously 
demonstrated that the pCytTs system works efficiently in additional cells lines such 
as CHO-Kl and COS-7 (Boorsma et al. Nature Biotech, 75:429-432 (2000)). 

The nucleic acid molecules and/or expression systems of the invention may 

20 be introduced into host cells using well known techniques of infection, transduction, 
electroporation, transfection, and transfomiation. Exemplary methods include 
DEAE-dextran mediated transfection, transient transfection, microinjection, cationic 
lipid-mediated transfection, scrape loading and ballistic introduction. Methods for 
the introduction of exogenous DNA sequences into host cells are discussed in 

25 Feigner, P. et al, U.S. Patent No. 5,580,859. The nucleic acid molecules and/or 
vectors of the invention may be introduced alone or in conjimction with other nucleic 
acid molecules and/or vectors and/or proteins, peptides or RNAs. Alternatively, the 
nucleic acid molecules and/or expression systems of the invention may be introduced 
into host cells as a precipitate, such as a calciimi phosphate precipitate, or in a 

30 complex with a lipid. Electroporation also may be used to introduce the nucleic acid 
molecules and/or expression systems of the invention into a host. Likewise, such 
molecules may be introduced into chemically competent cells such as E. coli. If the 
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vector is a virus, it may be packaged in vitro or introduced into a packaging cell and 
the packaged virus may be transduced into cells. Hence, a wide variety of techniques 
suitable for introducing the nucleic acid molecules and/or vectors of the invention 
into host cells are well known and routine to those of skill in the art. Such 
5 techniques are reviewed at length, for example, in Sambrook, J., et al.. Molecular 
Cloning, a Laboratory Manual, 2nd Ed., Cold Spring Harbor, NY: Cold Spring 
Harbor Laboratory Press, pp. 16.30-16.55 (1989), Ausubel, F. et al, eds.. Current 
Protocols in Molecular Biology, John H. Wiley & Sons, Inc. (1997), Chapter 
16), Watson, J.D., et al.. Recombinant DNA, 2nd Ed,, New York: W.H. Freeman and 

10 Co., pp. 213-234 (1992), and Winnacker, E.-L., From Genes to Clones, New York: 
VCH Publishers (1987), which are illustrative of the many laboratory manuals that 
detail these techniques and which are incorporated by reference herein in their 
entireties for their relevant disclosures. 

The invention is also directed to recombinant host cells comprising one or 

15 more nucleic acid molecules or expression systems of the invention. Also included 
within the present invention are in vitro cell cultures that comprise such recombinant 
host cells. Methods for producing recombinant host cells and cell cultures 
comprising the same are well-known in the art. 

20 

PRODUCTION OF POLYPEPTIDES AND RNA MOLECULES 

The nucleic acid molecules, expression systems and recombinant host cells of 
the invention may be used for the production of proteins, polypeptides and RNA 

25 molecules, e.g., untranslated RNA molecules. The methods of the invention may 
comprise, e.g., introducing one or more nucleic acid molecules or expression systems 
of the present invention into host cells to produce recombinant host cells, culturing 
the recombinant host cells imder conditions suitable for expression of the polypeptide 
or untranslated RNA molecule, and recovering the polypeptide or untranslated RNA 

30 molecule. 

The invention also provides methods for regulating the expression of a 
polypeptide or an untranslated RNA molecule. The methods of the invention may 



comprise, e.g,, (a) introducing one or more nucleic acid molecules or expression 
systems of the invention into a host cell to produce a recombinant host cell, wherein 
said nucleic acid molecules or expression systems comprise a first open reading 
frame having a nucleotide sequence encoding a temperature-sensitive RNA- 
5 dependent RNA polymerase, (b) growing said recombinant host cell under suitable 
culture conditions, and (c) changing the temperature of the recombinant host cell 
culture from: (i) a permissive temperature to a restrictive temperature, or (ii) a 
restrictive temperature to a permissive temperature. 

The present invention also provides methods for producing polypeptides and 

10 RNA molecules, said methods comprising introducing one or more nucleic acid 
molecules or expression systems of the invention into recombinant host cells, 
expanding the cells at a non-permissive temperature and incubating these cells at a 
permissive temperature. In a related aspect, the invention provides purified 
polypeptides and RNA molecules produced according to the methods of the present 

15 invention. 

Depending on the molecule, which is expressed, it may be obtained either 
from the culture supernatant or by lysing the recombinant host cells. 

Polypeptides produced using the nucleic acid molecules and expression 
systems of the invention can be recovered and purified from recombinant cell 

20 cultures by methods known in the art including ammonium sulfate precipitation, 
anion or cation exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography, and high performance liquid chromatography. Methods for 
purifying proteins are described in nxmierous sources (see, e.g., Celis, J., ed.. Cell 

25 Biology, Academic Press, 2""* edition, (1998)). 

Untranslated RNA molecules produced using the nucleic acid molecules and 
expression systems of the invention can be recovered and purified from recombinant 
cell cultures by methods known in the art (see, e.^., Celis, J., ed.. Cell Biology, 
Academic Press, 2"^ edition, (1998)). Methods for recovering and/or purifying RNA 

30 molecules include phenol/chloroform extraction, digestion with DNAses followed by 
precipitation of the undigested RNA molecules, and column chromatography (e.g,, 
oligo dT column chromatography). Further, RNA molecules can be separated from 



other cellular material using the single-step guanidinium-thiocyanate-phenol- 
chloroform method described in Chomczynski and Sacchi, Anal. Biochem, 
752:156-159(1987). 

The overall cell culture process employing nucleic acid molecules and 
5 expression systems of the invention for the production of polypeptides and/or 
untranslated RNA molecules can be implemented in a variety of bioreactor 
configurations (e.g., stirred-tank, perfused, membrane enclosed, encapsulated cell, 
fluidized bed, and air-lift reactors) and scales (from laboratory T-flasks to thousands 
of liters), chosen to accommodate the requirements of the host cell line utilized {e.g., 

10 anchorage dependency, O2 concentrations), to maximize the production of expression 
product, and to facilitate subsequent recovery and purification of expression product. 

The invention is also directed to the production of polypeptides or RNA 
molecules of interest using mammalian cells grown in serum-firee or protein-firee 
culture media. For example, by long-term culture under conditions restricting serum 

1 5 access or selecting for suspension growth, CHO cell lines are selected which are able 
to grow in serum-free mediimi and/or in suspension (Zang. M. et al, Bio/Technology 
7i:389(1995)). 

Further, a number of different bioprocess parameters can be varied in order to 
alter the glycosylation pattem of polypeptide products produced by the recombinant 
20 host cells of the invention. These factors include medium composition, pH, oxygen 
concentration, lack or presence of agitation, and, for the case of anchorage-dependent 
cells, the surface provided. Thus, the glycosylation pattem of glycoproteins may be 
altered by choosing the host cell in which these proteins are expressed in and the 
conditions under which the recombinant host cells are grown. 

25 

PHARMACEUTICAL COMPOSITIONS 

The invention further provides pharmaceutical compositions comprising 
30 nucleic acid molecules and/or expression systems and/or recombinant host cells of 
the invention. The pharmaceutical compositions of the invention may comprise 
nucleic acid molecules and/or expression systems and/or recombinant host cells of 
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the invention in combination {e.g,^ in solution) with a physiologically acceptable 
carrier and in a therapeutically effective amount. The administration of these 
pharmaceutical compositions may, for example, result in expression of a polypeptide 
in tissues of an animal which is immunogenic and intended to function as a 
5 vaccination. Similarly, the nucleic acid molecules and/or expression systems and/or 
recombinant host cells of the invention may carry sequences that encode 
polypeptides or RNA molecules required for the treatment of an active affliction. 
The administration of a pharmaceutical composition of the invention will thus be 
intended to have a therapeutic effect in these instances. 

10 The nucleic acid molecules and/or expression systems and/or recombinant 

host cells of the invention will normally be administered to an individual in a 
pharmacologically acceptable carrier. A composition is said to be 
"pharmacologically acceptable" if its administration can be tolerated by a recipient 
individual. Further, the composition of the invention will be administered in a 

15 "therapeutically effective amount" (e.g., an amount that produces a desired 
physiological effect). 

As would be imderstood by one of ordinary skill in the art, when the nucleic 
acid molecules and/or expression systems and/or recombinant host cells of the 
invention are administered to an individual, they may be in a composition which 

20 contains salts, buffers, adjuvants, or other substances which are desirable for 
improving the efficacy of the composition. Examples of materials suitable for use in 
preparing pharmaceutical compositions are provided in numerous sources including 
Remington*s Pharmaceutical Sciences (Osol, A, ed.. Mack Publishing Co., 
(1980)). 

25 The pharmaceutical compositions of the present invention can be 

administered by various art known means but will normally be administered by 
injection, infusion or other suitable physical methods. The compositions may 
alternatively be administered intramuscularly, intravenously, or subcutaneously. 
Components of compositions for administration include sterile aqueous (e.g., 

30 physiological saline) or non-aqueous solutions and suspensions. Examples of non- 
aqueous solvents are propylene glycol, polyethylene glycol, vegetable oils such as 
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olive oil, and injectable organic esters such as ethyl oleate. Carriers or occlusive 
dressings can be used to increase skin permeability and enhance antigen absorption. 

When recombinant host cells are administered to an individual, the number of 
cells or nucleic acid molecules required to provide a therapeutically effective amoxmt 
5 will vary with such factors as the individual's condition, the polypeptides or RNA 
molecules intended to be expressed, and the size of the individual. 



KITS 

10 

The invention also provides kits comprising the isolated nucleic acid 
molecules, expression systems and/or recombinant host cells of the invention. The 
kits of the invention may optionally comprise one or more additional components 
selected from the group consisting of one or more containers (e.g., boxes, vials, 
15 tubes, jars ampules, etc.) one or more vectors, one or more nucleotides, one or more 
primers, one or more polypeptides having polymerase activity, one or more host cells 
(e.g., host cells that may be competent for uptake of nucleic acid molecules), and one 
or more buffers. 

It will be readily apparent to one of ordinary skill in the relevant arts that 
20 other suitable modifications and adaptations to the methods and applications 
described herein are obvious and may be made without departing from the scope of 
the invention or any embodiment thereof. Having now described the present 
invention in detail, the same will be more clearly imderstood by reference to the 
following examples, which are included herewith for purposes of illustration only 
25 and are not intended to be limiting of the invention. 
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EXAMPLE 1 

CONSTRUCTION OF THE pCytTs VECTOR SYSTEM 

Manipulations and sequencing of DNA were carried out by standard 
5 procedures. The mutations in nsP2 were introduced by PCR using the following 
oligonucleotides : 

oligo-nsp2 1: 5'-AACATTGA AATCGAT ATTACAGGGG (SEQ ID NO:3), 
oligo-nsp2 2: 5'-CGGGTTAT GGTCGAC CGGGC (SEQ ID NO:4), 
oligo-nsp2 3:5'-GTGCCCTCCCCTGA GTTTAAA CAATTCAGGGCCGAA 
1 0 CGCG (SEQ ID NO:5), and 

oligo-nsp2 4: 5'-GAATTG TTTAAA CTCAGGAGGCACCCTCGTGG (SEQ 
ID NO:6). 

The single restriction sites used for first analysis and subsequent cloning 
(Dral, Clal and Sail) are underlined. PCR reactions were performed using either 

15 oligo-nsp2 1 (SEQ ID NO:3) and oligo-nsp2 3 (SEQ ID NO:5) or oUgo-nsp2 2 (SEQ 
ID NO:4) and oligo-nsp2 4 (SEQ ID NO:6). 100 pmol of each oligo was used and 5 
ng of the template DNA (pSinRepS; Xiong, C. et al. Science 243:1188-1191 (1989)) 
was used in the 100 Q>\ reaction mixture, containing 4 units of Taq or Pwo 
polymerase, 0.1 mM dNTPs and 1.5 mM MgS04. All DNA concentrations were 

20 determined photometrically using the GeneQuant apparatus (Pharmacia Biotech Inc., 
800 Centeimial Ave., Piscataway, NJ. 08854). The polymerase was added directly 
before starting the PCR reaction (starting point was 95'^C). The temperature cycles 
were as follows: 95^C for 2 minutes, followed by 5 cycles of 95°C (45 seconds), 
SS'^C (30 seconds), 72°C (90 seconds) and followed by 25 cycles of 95^C (45 

25 seconds), 68°C (30 seconds), 72^C (90 seconds). 

The two PCR fragment were purified using Qia spin PCR kit (QIAGEN, Inc., 
9259 Eton Avenue, Chatsworth, CA, 91311) and finally digested in an appropriate 
buffer using 20 units of Sail and Z)ral, respectively 20 units of Clal and Dral. The 
digestion was performed for 12 hours at 37®C. The DNA fragments were gel- 
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purified (Gene-Clean; Bio 101 Inc., 1070 Joshua Way, Vista, CA, 92083, USA) and 
finally ligated into ClaVSaR digested and gel-purified SinRepS vector (Xiong, C. et 
al. Science 2^i:l 188-1 191 (1989). The correct sequence of the obtained vector was 
checked by DNA sequencing of the whole nsP2 gene. 
5 The mutations in nsP4 were also introduced by PGR using the following 

oligonucleotides: 

oligo-nsp4 1: 5'-GGTAGAGGAGACAGTCGCATGCCTGGATAC (SEQ ID 

NO:7), 

oligo-nsp4 2: 5^-GTATGGA GGGATGC GACTGTGTCGTCTAGG (SEQ ID 

10 NO:8), 

oligo-nsp4 3: 5'-GAGAGGGGTTAACGGGATAGGG TGG (SEQ ID NO:9), 

and 

oligo-nsp4 4: 5'-GTGTAT TAGTAGT ATGGAGAGTTGG (SEQ ID NO: 10), 

The singular restriction sites used for the first analysis and the final cloning 
15 step (Sphl^ Hpal and Spel) are underlined. Two PGR reactions were carried out as 
described above using either oligo-nsp4 1 (SEQ ID NO: 7) and oligo-nsp4 3 (SEQ ID 
NO:9) or oligo-nsp4 2 (SEQ ID NO:8) and oUgo-nsp4 4 (SEQ ID NO: 10). 

Both PGR products were gel-purified and then used in assembly PGR to 
amplify the whole nsP4 gene. For the assembly PGR, 50 pmol of the outer primers 
20 (3 and 4) and about 10 ng of each PGR fragment was used. The reaction volume was 
100 containing 4 units of Taq or Pwo polymerase, 0.1 mM dNTPs and 1.5 mM 
MgS04 . The PGR conditions were as followed: 

95°G for 2 minutes, followed by 5 cycles of 92°G (45 seconds), 58^G (30 
seconds), 72*'G (120 seconds) and followed by 25 cycles of 92^G (45 seconds), 64^G 
25 (30 seconds), 72*^0 (120 seconds). 

The obtained PGR fragment was purified as described above and the eluate 
was digested with 20 units of Spel and Hpal in an appropriate buffer. The fragment 
was gel-purified and ligated into gel-purified SpeVHpal restricted SinRep5 vector. 
The correct sequence of the obtained vector was checked by DNA sequencing. 
30 Ovemight digestion of SinRep5-nsP4mut and SinRep5-nsp2mut with 

SpeVHpal and gel purification of the nsp4 fragment and sinRep-nsp2mut vector. 
The nsp4mut fragment was ligated into the SinRep5-nsp2mut vector. The final step 
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was cloning the nsp gene into the 987/SinRep5 vector (Bredenbeek, P. et al, J. Virol. 
57:6439-6446 (1993)) using Clal and Hpal as restriction endonucleases, the resulting 
vector was named pCytTs (SEQ ID NO:l). 

5 Vector pCytTS2.1 (SEQ ID NO:2,) contams a different multiple cloning site 

than vector pCytTS, the RSV promoter is replaced by the CMV promoter, an SV40 
intron sequence is included in the replicon and a puromycin resistance marker under 
control of the SV40 promoter is integrated on the plasmid. These elements can be 
cloned into pCytTS (SEQ ID NO: 1) in the following way: 

10 Oligonucleotides Cyt-Link-FOR 5'- CTAGATTAATTAACTCGAG 

GCGCGCCG-3' (SEQ ID NO: 11) and Cyt-Link-Rev 5'- 
GGCCCGGCGCGCCTCGAGTTAATTAAT-3' (SEQ ID NO:12) are hybridized 
and ligated into XbaVBspl20l digested vector pCytTS. This step introduces a 
different multiple cloning site into pCytTS (Intermediatel). 

15 The CMV promoter is introduced by assembled PGR using the following 

oligonucleotides : 

5'CMV: 5'-ATAAGAATGCCGGCGGATCCGGCCATTAGC-3' (SEQ ID 

No: 13) 

3 'SinCMV: 5 '-CCGTCAATACGGTTCACTAAACGAGCTCTG 

20 CTTATATAGACC-3' (SEQ ID No: 14) 

5 'CMVSin: 5 '-GCTCGTTTAGTGAACCGTATTGACGGCGTA 

GTACACAC-3' (SEQ ID No: 15) 

3'Sin: 5'- ACGTCGGCCTCAATTTCGCG-3' (SEQ ID No: 16) 
The singular restriction site used for the final cloning step {NgoAYW) is 
25 underlined. Two PGR reactions are carried out as described above using either 
oligonucleotide 5'CMV 1 (SEQ ID NO:13) and 3'SinCMV (SEQ ID NO:14) or 
5'CMVSin (SEQ ID NO:15) and 3'Sin (SEQ ID NO:16). As template DNA vector 
pLNCX (Miller et al., 1989, Biotechniques, 7, 980-982) and vector pCytTS can be 
used, respectively. Both PGR products are gel-purified and then used in eissembly 
30 PGR. For the assembly PGR the outer primers (5'CMV and 3'Sin) are used, 
resulting in an 3089 bp size PGR fi-agment. The obtained PGR fragment is purified as 
described above and the eluate is digested with NgoAW and Eco47\\l in an 
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appropriate buffer. The fragment is gel-purified and ligated into gel-purified 
NgaAIV/Eco47lll restricted IntermediateL This cloning step replaces the RSV 
promoter in pCytTS with the CMV promoter (Intermediate!!). 

The SV40 intron is amplified by PCR from vector pcDNAl.lamp 
5 (!nvitrogen) using the following oligonucleotides: 

S V!ntron-FOR: 5 '-GCGCGCGGGCCCAGAGGATCTTTGTGA AGG-3 ' 
(SEQ !D NO: 17) 

SVIntron-REV: 5'-GCGCGCGCGGCCGCTACATCAAATATTT TTCC-3' 
(SEQ !D NO: 18) 

10 The PCR conditions can be choosen as followed: 94°C for 2 minutes, 

followed by 30 cycles of 94°C (45 seconds), 52°C (30 seconds), 72°C (60 seconds) 
and followed by a 7 min elongation step at 72°C. The obtained PCR fragment is 
purified using the Qia spin PCR kit (Q!AGEN, Inc., 9259 Eton Avenue, Chatsworth, 
CA, 91311). Finally the purified PCR fragment is digested in the corresponding 

15 buffer with Bspl20llNotl and ligated into Bspl20l digested Intermediate!!. The 
orientation of the insert is controlled by restriction enzyme analysis (Intermediate!!!). 
The SV40 intron is thus introduced behind the multiple cloning site. 

To introduce the puromycin resistance marker, vector pPUR 
(Clontech Laboratories, Inc., Cat. No. 6156-1) is modified first. Vector pPUR is 

20 digested with Xbal and the lineraized vector is treated with Klenow polymerase in 
order to get blimt ends. Religation of the vector destroyes the Xbal site. The resulting 
vector serves as DNA template in the PCR amplification of the puromycin resistance 
marker. The following oligonucleotides are used: 

5'SV40PUR: 5'- ACG TACGCGT GCGGCCGCGTTAGGGTGTGG 

25 AAAGTCCCC-3' (SEQ ID NO:19) 

3'SV40PUR: 5'- ACGTACGCGTTGGACAAACCACAACTAG AATGC-3' 
(SEQ ID NO:20) 

The singular restriction site used for the cloning step (Mlul) is 
underlined. PCR conditions can be choosen as described before with an annealing 
30 temperature at 45^C. The resulting PCR fragment is purified as described above, 
digested with Mlul and ligated into Adlul digested Intermediate!!!. The resulting 



^9- 



vector pCytTs2,l (SEQ ID NO:2,) contains the puromycin resistance marker under 
control of the SV40 promoter. 

EXAMPLE 2 

COMBINATION OF THE TEMPERATURE SENSITIVE pCytTs SYSTEM WITH 
5 THE SELF REPLICATING EBNA SYSTEM 

This system was generated in order to rapidly generate cell populations, 
which inducibly express a gene of interest. In this system, the tightly regulated 
PC34TS system is combined with the episomally replicating EBV system. 

10 A. Construction of various pCytTs constructs containing either the EBNA origin 
of replication (OriP) alone or in conjunction with the replication initiation factor 
EBNAL 

In order to be able to subclone either the origin of replication of Epstein-Barr 

15 virus alone or together with the replication initiation factor EBNAl we first 
subcloned this two cassettes into a shuttle vector from which they can easily be 
transferred into our pCytTs system with or without various inserts. Combinatorial 
constructs with pCj^Ts mentioned in the Examples such as pCytTs-OriP, pCytTs- 
SEAP-OPE, pCytTs-SEAP or the like generally contain a CMV promoter. The 

20 sequence of the pCytTs v^th the CMV promoter is given in SEQ ID NO: 2 
(pCytTs2.1; see Example 1). For sake of simplicity, however, the mentioned 
combinatorial constructs are abbreviated by simply stating pCytTs. The Epstein-Bzur 
virus origin of replication OriP was amplified by PCR from pCEP4 (Invitrogen) 
using primers PH 51 and PH 52. The forward primer (PH 51) corresponds to 

25 nucleotides 261-280 of pCep4 and the reverse primer (PH 52) to nucleotides 2221- 
2239 of pCep4. PCR was performed using pfx polymerase (BRL) according to 
manufacturer's recommendations. The resulting PCR product (nucleotides 261-2239 
of pCep4) was then cloned into pGEMTeasy vector (Invitrogen), which had 
previously been digested with Eco RI and treated with Klenow poljmierase in order 

30 to get blimt ends. The resulting plasmid was termed pGemT-OriP. In order to get a 
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subclonable cassette containing OriP and EBNAl gene, pCep4 was digested Avith 
Dra III and Cla L The resulting 465 1 bp fragment was purified and blunt ends were 
generated by T4 DNA polymerase treatment. This fragment was then subcloned into 
pGEMTeasy vector (Invitrogen), which had previously been digested with Eco RI 
5 and treated with BGenow polymerase in order to get blunt ends. The resulting 
construct was termed pGemT-OPE. From these plasmids either the origin of 
replication alone (pGemT-OriP) or together with the replication initiation factor 
(pGemT-OPE) could be excised by Not I digestion and transferred into Not I sites of 
the various CytTs vectors (e.g, pCytTs vectors containing the RSV or CMV 

10 promoter (SEQ ID NO: 1 or SEQ ID NO: 2) which resulted in the vectors pCytTs- 
OriP (SEQ ID NO: 21), and pCytTs-OPE (SEQ ID NO: 22). pCytTs-IFNp, pCytTs- 
EGFP, pCytTs-SEAP and pCytTs-cEPO vectors were linearized with Not I and the 
Not I fragment of pGemT-OPE was introduced by ligation leading to following 
constructs pCytTs-IFNp-OPE pCytTs-EGFP-OPE (Fig. 1), pCytTs-SEAP-OPE and 

15 pCytTs-EPO-OPE. Similarly pCytTs-EPO-OriP and pCytTs-EGFP-OriP were 
generated by cloning the Not I fragment from pGemT-OriP into the Not I site of 
pCytTs-cEPO and pCytTs-EGFP respectively. 

Initial experiments were performed with a construct in which a slightly 
truncated form of OriP had been subcloned into pCytTs-EGFP. pCep4 (Invitrogen) 

20 was digested with Dra III and Nsi I and the 1923 nucleotide fragment (position 200- 
2124 from pCep4) was then treated with T4 polymerase in order to get blunt ends. 
The blunted fragment was then subcloned into pCytTs-EGFP, which had previously 
been digested with Not I and blunted with T4 DNA polymerase. The resulting 
plasmid was termed pCytTs-EGFP-OriPd. 

25 

B. Analysis of pCytTs-EGFP-OriPd in 293 EBNA cells 

We first analyzed whether the introduction of Epstein-Barr virus derived 
origin of replication (OriP) into the pCytTs system would improve the inducible 
30 expression of a gene of interest on selected cell populations. We therefore transfected 
293-EBNA cells (Invitrogen) with pCytTs-EGFP-OriPd. These cells constitutively 
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express Epstein-Barr virus replication initiation factor EBNAl and support OriP 
mediated DNA replication. 293-EBNA cells were transfected with pCytTs-EGFP- 
OriPd using Lipofectamine 2000 (BRL) according to the manufacturer's 
reconraiendation. 36h after transfection cells were split 1 to 3 in presence of l|ig/ml 
5 puromycin. After 1 to 2 days all non-transfected cells had detached and the resistant 
cells were further expanded under puromycin selection for another week. In order to 
asses the expression fi'om the new vector, resistant cell populations were passed to 
two plates of which one was kept at 37°C (uninduced) and one was shifted to 29°C 
for induction of EGFP expression. 24h after induction cells were harvested by 

10 trypsinisation and analysed by flow cytometry. The results are shown in Table 1. 
Upon induction cell populations carrying pCytTs-EGFP-OriPd were to 80% EGFP 
positive and displayed strong EGFP expression. In contrast cells, which had not been 
induced showed only weak EGFP expression and only about 8% of the cells, were 
green. Considering that stable cell populations containing CytTs-EGFP would only 

1 5 yield about 2-5% EGFP positive cells upon induction (data not shown) these results 
suggest that the OriP, which was introduced in the new vector, is fimctional and that 
it significantly improves the number of cells in a cell population, which expresses the 
gene of interest. 





% GFP positive cells 




29«C 


pCytTS-EGFP-Orip 


8.2% 


79% 



20 

Table 1: GFP expression with pCytTS-EGFP-OriPd 293-EBNA cells Iday 
after induction. 



25 



C. Comparison of pCytTs-EGFP, pCytTs-EGFP-OriP and pCytTs- 
EGFP-OPE in BHK cells 
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The following constructs, pCytTs-EGFP, pCytTs-EGFP-OriP and pCytTs- 
EGFP-OPE, were next examined in BHK (Baby hamster kidney) cells which are 
known to show tight regulation for the CytTs system. BHK cells were transfected 
with CytTs-EGFP, CytTs-EGFP-OriP and CytTs-EGFP-OPE using Lipofectamine 
5 2000 (BRL) according to the manufacturer's recommendation. 24h after transfection 
cells were split 1 to 3 in presence of 5]xg/ml puromycin. After 2 to 3 days all non- 
transfected cells had detached and the resistant cells were fixrther expanded under 
puromycin selection for another week. Resistant cell populations were passed to two 
plates of which one was kept at 37 °C (non-induced) and one was shifted to 29 °C for 

10 induction of replicon replication and EGFP expression. Three days after induction 
cells were harvested and analysed by flow cj^ometry. The results are shown in Table 
2. About 20% of the cell populations carrying CytTs-EGFP or CytTs-EGFP-OriP 
were EGFP positive upon induction, whereas four times more cells - up to 90% of 
the cells transfected with CytTs-EGFP-OPE did express EGFP after temperature 

1 5 induction. Thus, the inventive expression systems tumed out to be very tight since 
less than 0.5% of the uninduced population showed weak EGFP expression. 





% GFP positive cells 




29*C 


pCytTS-EGFP 


0.20 


19 


pCytTS-EGFP- 
OriP 


0.00 


21 


pCytTS-EGFP- 
OPE 


0.00 


93 



Table 2: GFP expression with pCytTS-EGFP, pCytTS-EGFP-OriP and 
20 pCytTS-EGFP- in BHK cells after 3 days of induction. 



D. Inducible expression of pCytTs-IFNp-OPE and pCytTs-SEAP-OPE in 
BHK cells 

25 



We further analyzed constructs which contain both OriP and the replication 
initiation factor EBNAl (pCytTs-IFNp-OPE) with other genes of interest m BHK 
cells. These constructs do not depend on cell lines which have an integrated copy of 
the EBNAl gene. As a gene of interest we choose IFNp, which had already been 
5 expressed with the conventional CytTs system. To test whether our new vector 
system with its episomal replication could result in high, inducible levels of 
expression on selected cell populations we transfected BHK cells with pCj^Ts-IFNfi- 
OPE (see Example 2A) using Lipofectamine 2000 (BRL) according to the 
manufacturer's recommendation. 36h after transfection cells were split 1 to 3 in the 

10 presence of Sjig/ml puromycin. After 1 to 2 days all non-transfected cells had 
detached and the resistant cells were fiirther expanded under puromycin selection for 
another 1-2 weeks. In order to assess the expression from the new vector, resistant 
cell populations were passed to two plates one of which was kept at 37°C 
(iminduced) and one was shifted to 29°C for induction of IFNp expression. Tissue 

15 culture supematants were harvested from the uninduced control plate and 3, 4, 7 and 
9 day after induction from the plate that had been shifted to 29 ""C. The IFNp 
production at the different time points was than quantitated by ELISA (Kit of 
Fujirebio Inc., Tokyo, Japan). The induced cell populations reached a maximal level 
of 1,000,000 lU/ml after 7-9 days of induction and no IFN production could be 

20 detected in cells kept at non-permissive temperature Table 3. These results show that 
our new vector system combining the CytTs system with the episomaly replicating 
EBNA system leads to high production levels in BHK cells on selected populations. 
The expression levels are at least as high as the ones obtained from a stable cell 
clone, which resulted from 4 subcloning roimds (data not shown). In addition the 

25 new system is still tightly regulated as demonstrated by up to 300,000 fold induction 
observed upon shifting the cells to 29°C. 



Days 


lU/ml 


Fold induction 


uninduced 


3 




3 


544410 


159651 


4 


717990 


210554 
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7 


988880 


289994 


9 


994140 


291537 



Table 3: IFN-P production with pCytTS-IFN-OPE in BHK cells after 
different days of induction. 

Besides IFNP production we also analysed SEAP production with our new 
5 vector system in BHK cells. Selected cell populations were generated as described 
under C using construct pCytTs-SEAP-OPE. After two weeks of selection and 
propagation the resistant cells were passed into two plates of which one was kept at 
37°C and the other was shifted to 29°C for induction. Samples were taken from the 
uninduced cell populations and from the induced cell population 3, 4, 7 and 9 days 
10 after induction. SEAP concentrations were measured in these samples using a kinetic 
ELISA for SEAP activity. The results are shown in Table 4. 



Days 


SEAP (mg/l) 


uninduced 


0 


3 


22 


4 


66 


7 


125 


9 


268 



Table 4: SEAP production wdth pCytTS-SEAP-OPE after different days of 
15 induction 

E. Comparison of the EPO production derived from pCytTs-OriP-EPO 
or with pCytTs-OPE-EPO in 293 EBNA cells 

20 We fiirther investigated whether an additional copy of the EBNA-1 gene on 

the same plasmid leads to higher expression of the gene of interest in 293 -EBNA 
cells. 293-EBNA cells were transfected either with pCytTs-OriP-EPO or with 
pCytTS-OPE-EPO (see Example 2A) using Lipofectamin 2000 (BRL) according to 
the manufacturer's recommendation. One day after transfection the cells were split 1 
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to 2.5 in the presence of 0.8 |ag/ml puromycin. A stable cell pool was obtained by 
further keeping these cells under selective pressure for one week. Resistant cell 
populations were then passed to two plates. Once the cells had attached, one plate 
was kept at 37°C and one plate was shifted to 29°C for induction. Cell culture 
5 supematants were collected from both plates 4 and 8 days after induction. The 
erythropoietin (EPO) levels in the cell culture supematants were then determined 
with an ELISA kit (R&D Systems, Minneapolis, USA). As shown in Table 5 the 
additional copy of the EBNAl gene on the plasmid leads to a 15 to 20 fold increase 
in the EPO production compared to the plasmid which only contains the OriP. In 

10 conclusion, this experiment shows that, even in cells which contain an integrated 
copy, the expression of the gene of interest can be increased by an additional copy on 
the plasmid. These results suggest that the EBNAl dosage may be a limiting factor 
for eflScient replication to occur. In these experiments the EBNAl dosage was 
increased by a copy on the OriP- containing plasmid. Similar results may be obtained 

15 if the EBNAl dosage is increased in the cells either by the integration of fiirther 
copies or if the additional copies would be provided on a separate plasmid in trans. 





EPO (mg/l) 




pCytTS-EPO-OPE 


pCytTS-EPO-OriP 


uninduced 


0 


0.02 


4 


2.7 


0.2 


8 


3.9 


0.2 



20 Table 5: Improvement of expression with additional copies of EBNAl after 4 

and 8 days of induction 



EXAMPLES 

25 

CONSTRUCTION OF VECTORS OF THE pCytTs SYSTEM CONTAINING THE 
GLUTAMINE SYNTHETASE AS SELECTABEL MARKER 
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The glutamine synthetase gene is cloned either from hamster cells (CHO-Kl, 
Chinese hamster ovary, ATCC, Cat. No. CCL-61; BHK21, Hamster Syrian kidney 
cells, DSMZ, Cat. No. ACC 61) or human cells (HEK 293 cells (ATCC, Cat. No. 
5 CRL-1573; Hela ATCC, Cat. No. CCL-2; Raji cells, ATCC, Cat. No. CCL-86; 293 
EBNA , Invitrogen, Cat. No. R62007; and 143B cells, ATCC, Cat. No. CRL-8303) 
by the method of reverse transcription. Total RNA or cj^oplasmic RNA is isolated 
from the cells using the RNeasy Kit (Qiagen, Inc., 9259 Eton Avenue, Chatsworth, 
CA, 91311) according to the manufactures recommendation. Enrichment of poly(A)'*^ 

10 RNA from total RNA can be done by using the Oligotex mRNA Kit (Qiagen, Inc., 
9259 Eton Avenue, Chatsworth, CA, 91311). The resulting RNA can be treated with 
DNasel to remove residual traces of DNA. cDNA synthesis is performed in the first 
step using the ThermoScript RT-PCR System (Invitrogen, Cat. No. 1 1 146-024) with 
either total RNA or poly(A)+ -selected RNA primed with oligo(dT) or random 

15 primers according to the manufactures recommendation. Treatment of cDNA with 
RNase H to remove the complementary RNA prior to PCR is optional. In the second 
step, PCR is performed using primers specific for the hamster or human glutamine 
synthetase. To amplify the cDNA of hamster and human glutamine synthetase the 
following oligonucleotides can be used: 

20 hamGSfor: 5'- GTCATGAAGCTTGCCACCGCTCAGAGC ACCTTC-3' 

(SEQ ID NO: 23) 

ham GSrev: 5 GAC TTCTAGA CTGGGGCGGGGTGGGATGAAC-3 ' 
(SEQ ID NO: 24) 

hGSfor: 5'- GTCATGAAGCTICTCGGCGACCAGAACACCTTC-3' (SEQ 
25 ID NO: 25) 

hGSrev: 5'- GAC TTCTAGA CTGGGGCGGGGTGGGATGAAC (SEQ ID 
NO: 26) 

Restriction sites used for cloning are underlined (Hindlll^ Xbal), The 
resulting PCR fragment is purified using Qia spin PCR kit (Qiagen, Inc., 9259 Eton 
30 Avenue, Chatsworth, CA, 91311). 

To replace the puromycin resistance marker with the glutamine synthetase 
cassette in vector pCytTS2,l, the glutamine synthetase cassette is isolated from Mlul 
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and BamHl digested vector pGL3-GS. The fragment is treated with Klenow 

polymerase in order to get blunt ends and ligated into Mlul digested vector 

pCytTS2.1 treated with Klenow polymerase. 

To combine the resulting pCytTS2.1gs vector with the self replicating EBNA 
5 system either the origin of replication alone (pGEMT-OriP) or together with the 

replication initiation factor (pGemT-OriP) can be excised by Notl digestion and 

transferred into the Notl site of pCytTS2.1gs. 

Vectors of the present invention containing the glutamine synthetase gene as 

selection marker are transfected in different cell lines as described above 
10 (EXAMPLE 2). Mammalian cell lines that do not express GS (glutamine synthetase) 

cannot survive without added glutamine in the medium. For these cell lines, a 

transfected GS gene can function as a selectable marker by permitting growth in a 

glutamine-free medium (e.g. mouse myeloma NSO cell line). For other cell lines (e.g. 

CHO-Kl) producing endogenous GS, methionine sulphoximine (MSX) has to be 
1 5 added as selective agent to the glutamine-free medium. 

EXAMPLE 4 

20 STABILITY OF EXPRESSION IN DIFFERENT CELL LINES USING THE 

pCytTS-EBNA SYSTEM 

Vectors of the pCytTS-EBNA system containing either the EBNA origin of 
replication (OriP) alone or in conjunction with the replication initiation factor 

25 EBNAl were transfected in different cell lines. 

293 EBNA cells were transfected with vector pCytTS-EPO-OriP using 
Lipofectamine 2000 (BRL) according to the manufacturer's recommendation. One 
day after transfection cells were split 1 to 4 in the presence of 0.8 \xg/ml puromycin. 
Several days after selection a stable cell pool was obtained which was further 

30 expanded. To asses inducible EPO expression, resistant cell populations were 
trypsinized and two T25 flask were seeded with three million cells each. Once the 
cells had attached one T25 flask was kept at 37 (uninduced) and one was shifted 
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to 29 °C for induction of replicon replication and EPO expression. Cell culture 
supematants were collected from both plates seven days after induction. The 
erythropoietin (EPO) level in the cell culture supematants was determined with an 
ELISA kit (R&D Systems, Minneapolis, USA). A backup plate of the resistant cell 
5 pool was kept in culture and passaged about 1 to 10 twice a week. To determine 
stability of EPO expression over several weeks, cells from the backup plate were 
seeded at intervals of one or two weeks again on T25 flask as described above. Cells 
were induced for 7 days and EPO expression was determined by ELISA (Table 6). 





EPO (mg/l) 


Weeks 


37 "C 


29 'C 


1 


<0.1 


2.3 


2 


< 0.1 


2.8 


3 


<0.1 


4.4 


4 


< 0.1 


4.9 


5 


<0.1 


4.5 



10 

Table 6: Stability of pCytTS-EPO-OriP in 293-EBNA cells between 1 and 5 
weeks after completion of selection. 

These results show that stable cell pools containing a vector system 
15 combining the pCytTS system with the episomaly replicating EBNA system allow 
expression of EPO over several weeks without loss of productivity. EPO values in 
the range of 3-5 mg/l are obtained. Fiortheron the pCytTS system is tightly regulated 
as at 37°C hardly no EPO production can be detected. 

Having now fiilly described the present invention in some detail by way of 
20 illustration and example for purposes of clarity of xmderstanding, it will be obvious 
to one of ordinary skill in the art that the same can be performed by modifying or 
changing the invention within a wide and equivalent range of conditions, 
formulations and other parameters without affecting the scope of the invention or any 
specific embodiment thereof, and that such modifications or changes are intended to 
25 be encompassed within the scope of the appended claims. 



-59- 



AU publications, patents and patent applications mentioned in this 
specification are indicative of the level of skill of those skilled in the art to which this 
invention pertains, and are herein incorporated by reference to the same extent as if 
each individual publication, patent or patent application was specifically and 
5 individually indicated to be incorporated by reference. 
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CLAIMS: 

5 1 . A nucleic acid molecule comprising: 

(A) a first polynucleotide element which encodes an RNA molecule, said 
RNA molecule comprising: 

(a) at least one c/.y-acting sequence element; 

(b) a first nucleotide sequence comprising a first open reading 
10 frame, said first open reading fi-ame having a nucleotide 

sequence encoding an RNA-dependent RNA polymerase; and 

(c) at least one second nucleotide sequence selected fi-om the 
group consisting of: 

(i) a second open reading frame encoding a polypeptide; 
15 (ii) a nucleotide sequence complementary to all or a part of 

the second open reading frame of (i); and 
(iii) a nucleotide sequence encoding an imtranslated RNA 

molecule or complement thereof; 
wherein said second nucleotide sequence is operably linked to 
20 a promoter which is recognized by said RNA-dependent RNA 

polymerase; 

(B) a second polynucleotide element comprising an origin of replication; 
and 

(C) a third polynucleotide element encoding a replication initiation factor 
25 capable of recognizing said origin of replication. 

2. The nucleic acid molecule of claim 1, wherein said origin of replication is 
derived from a prokaryotic organism, a eukaryotic organism or a virus. 



30 



The nucleic acid molecule of claim 2, wherein said origin of replication is 
derived from a eukaryotic organism selected from the group consisting of 
yeast, mammals and insects. 
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4. The nucleic acid molecule of claim 1, wherein said origin of replication is 
derived from a DNA virus that allows for episomal replication. 

5 5. The nucleic acid molecule of claim 1, wherein said origin of replication is 
derived from a DNA virus, preferably from a DNA virus selected from the 
group consisting of Papillomavirus, Polyomavirus, Adenovirus, and 
Hepadnaviruses. 

10 6. The nucleic acid molecule of claim 1, wherein said origin of replication is 
derived from a Herpesvirus, and preferably wherein said origin of replication 
is derived from Epstein-Barr virus (EBV). 

7. The nucleic acid molecule of claim 1, wherein said origin of replication is 
15 oriF. 

8. The nucleic acid molecule of any one of the preceding claims, wherein said 
replication initiation factor is capable of operating as a plasmid maintenance 
factor. 

20 

9. The nucleic acid molecule of any one of the preceding claims, wherein said 
replication initiation factor is derived from a prokaryotic organism, a 
eukaryotic organism or a virus. 

25 10. The nucleic acid molecule of any one of the preceding claims, wherein said 
replication initiation factor is derived from a DNA virus, preferably derived 
from a DNA virus selected from the group consisting of Herpesvirus, 
Papillomavirus, Polyomavirus, Adenovirus, and Hepadnavirus. 

30 11. The nucleic acid molecule of any one of the preceding claims, wherein said 
replication initiation factor is derived from a Herpesvirus, and preferably 
wherein said replication initiation factor is derived from EBV. 
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12. The nucleic acid molecule of any one of the preceding claims, wherein said 
replication initiation factor is the EBNA-1 protein or a portion thereof. 

5 13. The nucleic acid molecule of any one of the preceding claims, wherein said 
replication initiation factor and said origin of replication are derived from the 
same organism or the same virus. 

14. The nucleic acid molecule of any one of the preceding claims, wherein said 
10 replication initiation factor and said origin of replication are derived from 

different organisms or viruses. 

15. The nucleic acid molecule of any one of the preceding claims further 
comprising a fourth pol5^ucleotide element, wherein said fourth 

15 polynucleotide element comprises a selection marker. 

16. The nucleic acid molecule of claim 15, wherein said selection marker confers 
resistance to Puromycin. 

20 17. The nucleic acid molecule of any one of the preceding claims, wherein said 
second open reading frame is in a translatable format after one or more RNA- 
dependent RNA replication events. 

18. The nucleic acid molecule of any one of the preceding claims, wherein said 
25 RNA-dependent RNA polymerase is selected from the group consisting of: 

(a) a temperature-sensitive RNA-dependent RNA polymerase; (b) a non- 
cytopathic RNA-dependent RNA polymerase; and (c) a temperature- 
sensitive, non-cytopathic RNA-dependent RNA polymerase. 



30 



19. 



The nucleic acid molecule of any one of the preceding claims, wherein said 
RNA-dependent RNA polymerase is of viral origin, and preferably wherein 
said RNA-dependent RNA polymerase is of alphaviral origin. 
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20. The nucleic acid molecule of any one of the preceding claims, wherein said 
RNA-dependent RNA polymerase is derived from a Sindbis vims, a Semliki 
Forest virus or an Aura virus. 

5 

21. The nucleic acid molecule of any one of the preceding claims, v^herein said 
RNA-dependent RNA polymerase is derived from a virus selected from the 
group consisting of Bebaru virus, Cabassou virus, Chikungunya virus, Easter 
equine encephalomyelitis virus. Fort morgan virus, Getah virus, Kyzylagach 

10 virus, Mayoaro virus, Middleburg virus, Mucambo virus, Ndumu virus, 

Pixuna virus, Tonate virus, Triniti virus, Una virus. Western equine 
encephalomyelitis virus, Whataroa virus, Venezuelan equine 
encephalomyelitis virus (VEE), and Ross River virus. 

15 22. The nucleic acid molecule of claim 18, wherein said temperature-sensitive 
RNA-dependent RNA polymerase has replicase activity at temperatures 
below 34°C which is at least five fold greater than the replicase activity 
exhibited at 34°C or above. 

20 23. The nucleic acid molecule of claim 18, wherein said temperature-sensitive 
RNA-dependent RNA polymerase has replicase activity at 34°C which is at 
least five fold lower than the replicase activity exhibited at 29^C. 

24. The nucleic acid molecule of any one of the preceding claims, wherein said 
25 second open reading frame encodes a cytokine, a lymphokine, a tumor 

necrosis factor, an interferon, a toxic polypeptide, a prodrug or a converting 
enzyme. 



25. 

30 



The nucleic acid molecule of any one of the preceding claims, wherein said 
second nucleotide sequence encodes an untranslated RNA molecule selected 



-64- 



from the group consisting of an antisense RNA molecule, a tRNA molecule, a 
rRNA molecule, and a ribozyme. 

An expression system comprising: 

(A) a first polynucleotide element which encodes an RNA molecule, said 
RNA molecule comprising: 

(a) at least one c/5-acting sequence element; 

(b) a first nucleotide sequence comprising a first open reading 
frame, said first open reading frame having a nucleotide 
sequence encoding an RNA-dependent RNA polymerase; and 

(c) at least one second nucleotide sequence selected from the 
group consisting of: 

(i) a second open reading frame encoding a polypeptide; 

(ii) a nucleotide sequence complementary to all or a part of 
the second open reading frame of (i); and 

(iii) a nucleotide sequence encoding an vmtranslated RNA 
molecule or complement thereof; 

wherein said second nucleotide sequence is operably linked to 
a promoter which is recognized by said RNA-dependent RNA 
polymerase; 

(B) a second polynucleotide element comprising an origin of replication; 
and 

(C) a third polynucleotide element encoding a replication initiation factor 
capable of recognizing said origin of replication. 

The expression system of claim 26, wherein said origin of replication is 
derived from a prokaryotic organism, a eukaryotic organism or a virus. 

The expression system of claim 27, wherein said origin of replication is 
derived from an eukaryotic organism selected from the group consisting of 
yeast, insects and mammals. 
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The expression system of claim 26, wherein said origin of replication is 
derived from a DNA virus that allows for episomal replication. 

The expression system of claim 26, wherein said origin of replication is 
derived from a DNA virus, preferably from a DNA virus selected from the 
group consisting of Herpesvirus, Papillomavirus, Polyomavirus, Adenovirus, 
and Hepadnaviruses. 

The expression system of claim 26, wherein said origin of replication is 
derived from a Herpesvirus, and preferably wherein said origin of replication 
is derived from Epstein-Barr virus (EBV). 

The expression system of claim 26, wherein said origin of replication is oriF, 

The expression system of any one of the preceding claims, wherein said 
replication initiation factor is capable of operating as plasmid maintenance 
factor. 

The expression system of any one of the preceding claims, wherein said 
replication initiation factor is derived from a prokaryotic organism or a 
eukaryotic organism, or a virus. 

The expression system of any one of the preceding claims, wherein said 
replication initiation factor is derived from a DNA virus, preferably derived 
from a DNA virus selected from the group consisting of Herpesvirus, 
Papillomavims, Polyomavirus, Adenovims, and Hepadnavirus. 

The expression system of any one of the preceding claims, wherein said 
replication initiation factor is derived from a Herpesvirus, and preferably 
wherein said replication initiation factor is derived from EBV. 
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37. The expression system of any one of the preceding claims, wherein said 
replication initiation factor is the EBNA-1 protein, or portion thereof. 

38. The expression system of any one of the preceding claims, wherein said 
5 replication initiation factor and said origin of replication are derived from the 

same organism or the same virus. 

39. The expression system of any one of the preceding claims, wherein said 
replication initiation factor and said origin of replication are derived from 

1 0 different organisms. 

40. The expression system of any one of the preceding claims, further comprising 
a fourth polynucleotide element, wherein said fourth polynucleotide element 
comprises a selection marker. 

15 

41. The expression system of claim 40, wherein said selection marker confers 
resistance to Puromycin. 

42. The expression system of any one of the preceding claims, wherein said 
20 second open reading frame is in a translatable format after one or more RNA- 

dependent RNA replication events. 

43. The expression system of any one of the preceding claims, wherein said first, 
second and third polynucleotide elements are DNA elements. 

25 

44. The expression system of any one of the preceding claims, wherein said first, 
second and third polynucleotide elements are each present on a separate 
nucleic acid molecule. 

30 45. The expression system of any one of the preceding claims, wherein said first, 
second and third polynucleotide elements are all present on a single nucleic 
acid molecule. 
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46. The expression system of any one of the preceding claims, wherein said first 
and said second polynucleotide elements are present on a first nucleic acid 
molecule, and said third pol5mucleotide element is present on a second 

5 nucleic acid molecule. 

47. The expression system of any one of the preceding claims, wherein said 
second nucleic acid molecule further comprises a fourth polynucleotide 
element capable of promoting the replication of said second nucleic acid 

10 molecule. 

48. The expression system of any one of the preceding claims, wherein said 
second nucleic acid sequence further comprises an origin of replication. 

15 49. The expression system of any one of the preceding claims, wherein said 
RNA-dependent RNA polymerase is selected from the group consisting of: 
(a) a temperature-sensitive RNA-dependent RNA polymerase; (b) a non- 
cytopathic RNA-dependent RNA polymerase; and (c) a temperature- 
sensitive, non-c54opathic RNA-dependent RNA polymerase. 

20 

50. The expression system of any one of the preceding claims, wherein said 
RNA-dependent RNA polymerase is of viral origin, and preferably wherein 
said RNA-dependent RNA polymerase is of alphaviral origin. 

25 51. The expression system of any one of the preceding claims, wherein said 
RNA-dependent RNA polymerase is derived from a Sindbis vims, firom a 
Semliki Forest virus or fi-om an Aura virus. 

52. The expression system of any one of the preceding claims, wherein said 
30 RNA-dependent RNA polymerase is derived fi"om a vims selected from the 

group consisting of Bebaru vims, Cabassou virus, Chikungunya vims, Easter 
equine encephalomyelitis vims. Fort morgan virus, Getah vims, Kyzylagach 
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virus, Mayoaro virus, Middleburg virus, Mucambo virus, Ndumu virus, 
Pixuna virus, Tonate virus, Triniti virus, Una virus. Western equine 
encephalomyelitis virus, Whataroa virus, Venezuelan equine 
encephalomyelitis virus (VEE), and Ross River virus. 

The expression system of claim 49, wherein said temperature-sensitive RNA- 
dependent RNA polymerase has replicase activity at temperatures below 
34°C which is at least five fold greater than the replicase activity exhibited at 
34°C or above. 

The expression system of claim 49, wherein said temperature-sensitive RNA- 
dependent RNA polymerase has replicase activity at 34°C which is at least 
five fold lower than the replicase activity exhibited at 29®C. 

The expression system of any one of the preceding claims, wherein said 
second open reading frame encodes a cytokine, a lymphokine, a tumor 
necrosis factor, an interferon, a toxic protein, a prodrug or a converting 
enzyme. 

The expression system of any one of the preceding claims, wherein said 
second nucleotide sequence encodes an untranslated RNA molecule selected 
from the group consisting of an antisense RNA molecule, a tRNA molecule, a 
rRNA molecule, and a ribozyme. 

A method of making a recombinant host cell comprising introducing the 
nucleic acid molecule of claim 1 into a host cell. 

The method of claim 57, wherein said RNA-dependent RNA polymerase is 
selected from the group consisting of: (a) a temperature-sensitive RNA- 
dependent RNA polymerase; (b) a non-cytopathic RNA-dependent RNA 



polymerase; and (c) a temperature-sensitive, non-cytopathic RNA-dependent 
RNA polymerase. 

The method of claim 57, wherein said nucleic acid molecule is introduced by 
way of transfection. 

A method of making a recombinant host cell comprising introducing the 
expression system of claim 26 into a host cell. 

The method of claim 60, wherein said RNA-dependent RNA polymerase is 
selected from the group consisting of: (a) a temperature-sensitive RNA- 
dependent RNA polymerase; (b) a non-cytopathic RNA-dependent RNA 
polymerase; and (c) a temperature-sensitive, non-cytopathic RNA-dependent 
RNA polymerase. 

The method of claim 60, wherein said expression system is introduced by 
way of transfection. 

A recombinant host cell produced by the method of claim 57. 
A recombinant host cell produced by the method of claim 60. 
A recombinant host cell comprising: 

(A) a first polynucleotide element which encodes an RNA molecule, said 
RNA molecule comprising: 

(a) at least one cw-acting sequence element; 

(b) a first nucleotide sequence comprising a first open reading 
frame, said first open reading frame having a nucleotide 
sequence encoding an RNA-dependent RNA polymerase; and 

(c) at least one second nucleotide sequence selected from the 
group consisting of: 
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(i) a second open reading frame encoding a polj^jeptide; 

(ii) a nucleotide sequence complementary to all or a part of 
the open reading frame of (i); and 

(iii) a nucleotide sequence encoding an untranslated RNA 
5 molecule or complement thereof; 

wherein said second nucleotide sequence is operably linked to 
a promoter which is recognized by said RNA-dependent RNA 
polymerase; 

(B) a second polynucleotide element comprising an origin of replication; 
10 and 

(C) a third polynucleotide element encoding a replication initiation factor 
capable of recognizing said origin of replication. 

66. The recombinant host cell of claim 65, wherein said third polynucleotide is 
15 stably integrated into the genome of said host cell. 

67. The recombinant host cell of claim 65, wherein said first and said second 
polynucleotide elements are present on a first nucleic acid molecule, and said 
third polynucleotide element is present on a second nucleic acid molecule. 

20 

68. The recombinant host cell of claim 65, wherein said first, second and third 
polynucleotide elements are provided on the same nucleic acid molecule. 

69. The recombinant host cell of claim 65, wherein said host cell comprises more 
25 than one copy of said third polynucleotide. 

70. The recombinant host cell of claim 69, wherein at least one copy of said third 
polynucleotide is stably integrated into the genome of said host cell. 

30 71. The recombinant host cell of any one of the preceding claims, wherein said 
second open reading frame is in a translatable format after one or more RNA- 
dependent RNA replication events. 
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72. The recombinant host cell of any one of the preceding claims, wherein said 
first, second and third polynucleotide elements are DNA elements. 

5 73. The recombinant host cell of any one of the preceding claims, further 
comprising a fourth polynucleotide comprising a selection marker, wherein 
said selection marker confers resistance to Puromycin 

74. The recombinant host cell of any one of the preceding claims, wherein said 
10 second nucleic acid molecule further comprises a fifth polynucleotide 

element capable of promoting the replication of said second nucleic acid 
molecule. 

75. The recombinant host cell of any one of the preceding claims, wherein said 
15 nucleotide sequence further comprises an origin of replication. 

76. The recombinant host cell of any one of the preceding claims, wherein said 
RNA-dependent RNA polymerase is selected from the group consisting of: 
(a) a temperature-sensitive RNA-dependent RNA polymerase; (b) a non- 
20 cytopathic RNA-dependent RNA polymerase; and (c) a temperature- 
sensitive, non-cytopathic RNA-dependent RNA polymerase. 

77. The recombinant host cell of any one of the preceding claims, wherein said 
recombinant host cell is a mammalian cell, and wherein preferably said 

25 mammalian cell is selected from the group consisting of a human cell, a 

primate cell and a rodent cell 

78. A method for producing a polypeptide or untranslated RNA molecule, said 
method comprising: 

30 (a) introducing the nucleic acid molecule of claim 1 into a host cell to 

produce a recombinant host cell; and 



(b) culturing said recombinant host cell under conditions suitable for 
expression of said polypeptide or untranslated RNA molecule. 



79. The method of claim 78, further comprising recovering said polypeptide or 
imtranslated RNA molecule. 

80. A method for producing a polypeptide or vintranslated RNA molecule, said 
method comprising: 

(a) introducing the expression system of claim 26 into a host cell to 
produce a recombinant host cell; and 

(b) culturing said recombinant host cell under conditions suitable for 
expression of said polypeptide or untranslated RNA molecule. 

81. The method of claim 80, further comprising recovering said polypeptide or 
untranslated RNA molecule. 

82. A method for regulating the expression of a polypeptide or an imtranslated 
RNA molecule, said method comprising: 

(a) introducing the nucleic acid molecule of claim 18 into a host cell to 
produce a recombinant host cell; 

(b) growing said recombinant host cell under suitable culture conditions; 
and 

(c) changing the temperature of the recombinant host cell culture from: 

(i) a permissive temperature to a restrictive temperature, or 

(ii) a restrictive temperature to a permissive temperature; 
wherein said polypeptide or untranslated RNA molecule is encoded by said 

first polynucleotide. 

83. The method of claim 82, wherein said polypeptide is a polypeptide that is 
toxic to said host cell. 
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84. A method for regulating the expression of a polypeptide or an imtranslated 
RNA molecule, said method comprising: 

(a) introducing the expression system of claim 49 into a host cell to 
produce a recombinant host cell; 
5 (b) growing said recombinant host cell imder suitable culture conditions; 

and 

(c) changing the temperature of the recombinant host cell culture from: 

(i) a permissive temperature to a restrictive temperature, or 

(ii) a restrictive temperature to a permissive temperature; 

10 wherein said polypeptide or untranslated RNA molecule is encoded by said 

first polynucleotide. 

85. The method of claim 84, wherein said polypeptide is a polypeptide that is 
toxic to said host cell. 

15 
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GENE EXPRESSION SYSTEM 

ABSTRACT OF THE DISCLOSURE 

The present invention relates to compositions and methods that allow the 
production of polypeptides and untranslated RNA molecules in host cells. More 
5 specifically, the invention provides nucleic acid molecules, expression systems, 
recombinant host cells, methods and kits, which enable the production and/or 
isolation and/or purification of polypeptides and untranslated RNA molecules. The 
compositions and methods of the invention can be applied in a regulatable expression 
system that is transiently transfected into mammalian host cells. 
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Abst:rac^ 

A probe assembly for an infrared medical thermometer without the 
necessity to put a plastic film probe cover onto the probe. 

The probe assembly includes a plastic probe body (1) , an 
elongated metal made hollow heat sink {2), which holds an 
infrared sensor unit (4) and conducts the heat increase of the 
sensor unit (4) during the temperature measurement, a hollow 
plastic cap (3) tightly fixed between the infrared sensor unit 
(4), metal heat sink (1), and the inner wall (11) of the probe 
body (1) . Like this, the probe assembly is waterproof and/or 
airtight, thereby offering an washable probe which can be 
disinfected after a performed measurement. Therefore, the 
concern of cross-contamination between patients is reduced. 



(Figure la) 
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cDNA 












<400> 1 
ctgacgcgcc 


ctgtagcggc 


gcattaagcg 


cggcgggtgt 


ggtggttacg 


cgcagcgtga 


60 


ccgctacact 


tgccagcgcc 


ctagcgcccg 


ctcctttcgc 


tttcttccct 


tcctttctcg 


120 


ccacgttcgc 


cggctttccc 


cgtcaagctc 


taaatcgggg 


gctcccttta 


gggttccgat 


180 


ttagtgcttt 


acggcacctc 


gaccccaaaa 


aacttgatta 


gggtgatggt 


tcacgtagtg 


240 


ggccatcgcc 


ctgatagacg 


gtttttcgcc 


ctttgacgtt 


ggagtccacg 


ttctttaata 


300 


gtggactctt 


gttccaaact 


ggaacaacac 


tcaaccctat 


ctcggtctat 


tcttttgatt 


360 


tataagggat 


tttgccgatt 


tcggcctatt 


ggttaaaaaa 


tgagctgatt 


taacaaaaat 


420 


ttaacgcgaa 


ttttaacaaa 


atattaacgc 


ttacaatttc 


cattcgccat 


tcaggctgcg 


480 


caactgttgg 


gaagggcgat 


cggtgcgggc 


ctcttcgcta 


ttacgccagc 


tggcgaaagg 


540 


gggatgtgct 


gcaaggcgat 


taagttgggt 


aacgccaggg 


ttttcccagt 


cacgacgttg 


600 


taaaacgacg 


gccagtgagc 


gcgcaattaa 


ccctcactaa 


agggaacaaa 


agctggctag 


660 


tggatccagt 


cttatgcaat 


actcttgtag 


tcttgcaaca 


tggtaacgat 


gagttagcaa 


720 


catgccttac 


aaggagagaa 


aaagcaccgt 


gcatgccgat 


tggtggaagt 


aaggtggtac 


780 


gatcgtgcct 


tattaggaag 


gcaacagacg 


ggtctgacat 


ggattggacg 


aaccactgaa 


840 


ttccgcattg 


cagagatatt 


gtatttaagt 


gccctacctc 


gataccgtcg 


agattgacgg 


900 


cgtagtacac 


actattgaat 


caaacagccg 


accaattgca 


ctaccatcac 


aatggagaag 


960 


ccagtagtaa 


acgtagacgt 


agacccccag 


agtccgtttg 


tcgtgcaact 


gcaaaaaagc 


1020 



2 



ttcccgcaat 


ttgaggtagt 


agcacagcag 


gtcactccaa 


atgaccatgc 


taatgccaga 


1080 


gcattttcgc 


atctggccag 


taaactaatc 


gagctggagg 


ttcctaccac 


agcgacgatc 


1140 


ttggacatag 


gcagcgcacc 


ggctcgtaga 


atgttttccg 


agcaccagta 


tcattgtgtc 


1200 


tgccccatgc 


gtagtccaga 


agacccggac 


cgcatgatga 


aatacgccag 


taaactggcg 


1260 


gaaaaagcgt 


gcaagattac 


aaacaagaac 


ttgcatgaga 


agattaagga 


tctccggacc 


1320 


gtacttgata 


cgccggatgc 


tgaaacacca 


tcgctctgct 


ttcacaacga 


tgttacctgc 


1380 


aacatgcgtg 


ccgaatattc 


cgtcatgcag 


gacgtgtata 


tcaacgctcc 


cggaactatc 


1440 


tatcatcagg 


ctatgaaagg 


cgtgcggacc 


ctgtactgga 


ttggcttcga 


caccacccag 


1500 


ttcatgttct 


cggctatggc 


aggttcgtac 


cctgcgtaca 


acaccaactg 


ggccgacgag 


1560 


aaagtccttg 


aagcgcgtaa 


catcggactt 


tgcagcacaa 


agctgagtga 


aggtaggaca 


1620 


ggaaaattgt 


cgataatgag 


gaagaaggag 


ttgaagcccg 


ggtcgcgggt 


ttatttctcc 


1680 


gtaggatcga 


cactttatcc 


agaacacaga 


gccagcttgc 


agagctggca 


tcttccatcg 


1740 


gtgttccact 


tgaatggaaa 


gcagtcgtac 


acttgccgct 


gtgatacagt 


ggtgagttgc 


1800 


gaaggctacg 


tagtgaagaa 


aatcaccatc 


agtcccggga 


tcacgggaga 


aaccgtggga 


1860 


tacgcggtta 


cacacaatag 


cgagggcttc 


ttgctatgca 


aagttactga 


cacagtaaaa 


1920 


ggagaacggg 


tatcgttccc 


tgtgtgcacg 


tacatcccgg 


ccaccatatg 


cgatcagatg 


1980 


actggtataa 


tggccacgga 


tatatcacct 


gacgatgcac 


aaaaacttct 


ggttgggctc 


2040 


aaccagcgaa 


ttgtcattaa 


cggtaggact 


aacaggaaca 


ccaacaccat 


gcaaaattac 


2100 


cttctgccga 


tcatagcaca 


agggttcagc 


aaatgggcta 


aggagcgcaa 


ggatgatctt 


2160 


gataacgaga 


aaatgctggg 


tactagagaa 


cgcaagctta 


cgtatggctg 


cttgtgggcg 


2220 


tttcgcacta 


agaaagtaca 


ttcgttttat 


cgcccacctg 


gaacgcagac 


ctgcgtaaaa 


2280 


gtcccagcct 


cttttagcgc 


ttttcccatg 


tcgtccgtat 


ggacgacctc 


tttgcccatg 


2340 


tcgctgaggc 


agaaattgaa 


actggcattg 


caaccaaaga 


aggaggaaaa 


actgctgcag 


2400 


gtctcggagg 


aattagtcat 


ggaggccaag 


gctgcttttg 


aggatgctca 


ggaggaagcc 


24 60 


agagcggaga 


agctccgaga 


agcacttcca 


ccattagtgg 


cagacaaagg 


catcgaggca 


2520 


gccgcagaag 


ttgtctgcga 


agtggagggg 


ctccaggcgg 


acatcggagc 


agcattagtt 


2580 


gaaaccccgc 


gcggtcacgt 


aaggataata 


cctcaagcaa 


atgaccgtat 


gatcggacag 


264 0 


tatatcgttg 


tctcgccaaa 


ctctgtgctg 


aagaatgcca 


aactcgcacc 


agcgcacccg 


2700 


ctagcagatc 


aggttaagat 


cataacacac 


tccggaagat 


caggaaggta 


cgcggtcgaa 


2760 


ccatacgacg 


ctaaagtact 


gatgccagca 


ggaggtgccg 


taccatggcc 


agaattccta 


2820 


gcactgagtg 


agagcgccac 


gttagtgtac 


aacgaaagag 


agtttgtgaa 


ccgcaaacta 


2880 
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taccacattg 


ccatgcatgg 


ccccgccaag 


aatacagaag 


aggagcagta 


caaggttaca 


2940 


aaggcagagc 


ttgcagaaac 


agagtacgtg 


tttgacgtgg 


acaagaagcg 


ttgcgttaag 


3000 


aaggaagaag 


cctcaggtct 


ggtcctctcg 


ggagaactga 


ccaaccctcc 


ctatcatgag 


3060 


ctagctctgg 


agggactgaa 


gacccgacct 


gcggtcccgt 


acaaggtcga 


aacaatagga 


3120 


gtgataggca 


caccggggtc 


gggcaagtca 


gctattatca 


agtcaactgt 


cacggcacga 


3180 


gatcttgtta 


ccagcggaaa 


gaaagaaaat 


tgtcgcgaaa 


ttgaggccga 


cgtgctaaga 


3240 


ctgaggggta 


tgcagattac 


gtcgaagaca 


gtagattcgg 


ttatgctcaa 


cggatgccac 


3300 


aaagccgtag 


aagtgctgta 


cgttgacgaa 


gcgttcgcgt 


gccacgcagg 


agcactactt 


3360 


gccttgattg 


ctatcgtcag 


gccccgcaag 


aaggtagtac 


tatgcggaga 


ccccatgcaa 


3420 


tgcggattct 


tcaacatgat 


gcaactaaag 


gtacatttca 


atcaccctga 


aaaagacata 


3480 


tgcaccaaga 


cattctacaa 


gtatatctcc 


cggcgttgca 


cacagccagt 


tacagctatt 


3540 


gtatcgacac 


tgcattacga 


tggaaagatg 


aaaaccacga 


acccgtgcaa 


gaagaacatt 


3600 


gaaatcgata 


ttacaggggc 


cacaaagccg 


aagccagggg 


atatcatcct 


gacatgtttc 


3660 


cgcgggtggg 


ttaagcaatt 


gcaaatcgac 


tatcccggac 


atgaagtaat 


gacagccgcg 


3720 


gcctcacaag 


ggctaaccag 


aaaaggagtg 


tatgccgtcc 


ggcaaaaagt 


caatgaaaac 


3780 


ccactgtacg 


cgatcacatc 


agagcatgtg 


aacgtgttgc 


tcacccgcac 


tgaggacagg 


3840 


ctagtgtgga 


aaaccttgca 


gggcgaccca 


tggattaagc 


agcccactaa 


catacctaaa 


3900 


ggaaactttc 


aggctactat 


agaggactgg 


gaagctgaac 


acaagggaat 


aattgctgca 


3960 


ataaacagcc 


ccactccccg 


tgccaatccg 


ttcagctgca 


agaccaacgt 


ttgctgggcg 


4020 


aaagcattgg 


aaccgatact 


agccacggcc 


ggtatcgtac 


ttaccggttg 


ccagtggagc 


4080 


gaactgttcc 


cacagtttgc 


ggatgacaaa 


ccacattcgg 


ccatttacgc 


cttagacgta 


4140 


atttgcatta 


agtttttcgg 


catggacttg 


acaagcggac 


tgttttctaa 


acagagcatc 


4200 


ccactaacgt 


accatcccgc 


cgattcagcg 


aggccggtag 


ctcattggga 


caacagccca 


4260 


ggaacccgca 


agtatgggta 


cgatcacgcc 


attgccgccg 


aactctcccg 


tagatttccg 


4320 


gtgttccagc 


tagctgggaa 


gggcacacaa 


cttgatttgc 


agacggggag 


aaccagagtt 


4380 


atctctgcac 


agcataacct 


ggtcccggtg 


aaccgcaatc 


ttcctcacgc 


cttagtcccc 


4440 


gagtacaagg 


agaagcaacc 


cggcccggtc 


aaaaaattct 


tgaaccagtt 


caaacaccac 


4500 


tcagtacttg 


tggtatcaga 


ggaaaaaatt 


gaagctcccc 


gtaagagaat 


cgaatggatc 


4560 


gccccgattg 


gcatagccgg 


tgcagataag 


aactacaacc 


tggctttcgg 


gtttccgccg 


4620 


caggcacggt 


acgacctggt 


gttcatcaac 


attggaacta 


aatacagaaa 


ccaccacttt 


4680 


cagcagtgcg 


aagaccatgc 


ggcgacctta 


aaaacccttt 


cgcgttcggc 


cctgaattgt 


4740 


ttaaactcag 


gaggcaccct 


cgtggtgaag 


tcctatggct 


acgccgaccg 


caacagtgag 


4800 
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gacgtagtca 


ccgctcttgc 


cagaaagttt 


gtcagggtgt 


ctgcagcgag 


accagattgt 


4 8 60 


gtctcaagca 


atacagaaat 


gtacctgatt 


ttccgacaac 


tagacaacag 


ccgtacacgg 


4 920 


caattcaccc 


cgcaccatct 


gaattgcgtg 


atttcgtccg 


tgtatgaggg 


tacaagagat 


4 980 


ggagttggag 


ccgcgccgtc 


ataccgcacc 


aaaagggaga 


atattgctga 


ctgtcaagag 


5040 


gaagcagttg 


tcaacgcagc 


caatccgctg 


ggtagaccag 


gcgaaggagt 


ctgccgtgcc 


5100 


atctataaac 


gttggccgac 


cagttttacc 


gattcagcca 


cggagacagg 


caccgcaaga 


5160 


atgactgtgt 


gcctaggaaa 


gaaagtgatc 


cacgcggtcg 


gccctgattt 


ccggaagcac 


5220 


ccagaagcag 


aagccttgaa 


attgctacaa 


aacgcctacc 


atgcagtggc 


agacttagta 


5280 


aatgaacata 


acatcaagtc 


tgtcgccatt 


ccactgctat 


ctacaggcat 


ttacgcagcc 


5340 


ggaaaagacc 


gccttgaagt 


atcacttaac 


tgcttgacaa 


ccgcgctaga 


cagaactgac 


5400 


gcggacgtaa 


ccatctattg 


cctggataag 


aagtggaagg 


aaagaatcga 


cgcggcactc 


5460 


caacttaagg 


agtctgtaac 


agagctgaag 


gatgaagata 


tggagatcga 


cgatgagtta 


5520 


gtatggattc 


atccagacag 


ttgcttgaag 


ggaagaaagg 


gattcagtac 


tacaaaagga 


5580 


aaattgtatt 


cgtacttcga 


aggcaccaaa 


ttccatcaag 


cagcaaaaga 


catggcggag 


5640 


ataaaggtcc 


tgttccctaa 


tgaccaggaa 


agtaatgaac 


aactgtgtgc 


ctacatattg 


5700 


ggtgagacca 


tggaagcaat 


ccgcgaaaag 


tgcccggtcg 


accataaccc 


gtcgtctagc 


5760 


ccgcccaaaa 


cgttgccgtg 


cctttgcatg 


tatgccatga 


cgccagaaag 


ggtccacaga 


5820 


cttagaagca 


ataacgtcaa 


agaagttaca 


gtatgctcct 


ccacccccct 


tcctaagcac 


5880 


aaaattaaga 


atgttcagaa 


ggttcagtgc 


acgaaagtag 


tcctgtttaa 


tccgcacact 


5940 


cccgcattcg 


ttcccgcccg 


taagtacata 


gaagtgccag 


aacagcctac 


cgctcctcct 


6000 


gcacaggccg 


aggaggcccc 


cgaagttgta 


gcgacaccgt 


caccatctac 


agctgataac 


6060 


acctcgcttg 


atgtcacaga 


catctcactg 


gatatggatg 


acagtagcga 


aggctcactt 


6120 


ttttcgagct 


ttagcggatc 


ggacaactct 


attactagta 


tggacagttg 


gtcgtcagga 


6180 


cctagttcac 


tagagatagt 


agaccgaagg 


caggtggtgg 


tggctgacgt 


tcatgccgtc 


6240 


caagagcctg 


cccctattcc 


accgccaagg 


ctaaagaaga 


tggcccgcct 


ggcagcggca 


6300 


agaaaagagc 


ccactccacc 


ggcaagcaat 


agctctgagt 


ccctccacct 


ctcttttggt 


6360 


ggggtatcca 


tgtccctcgg 


atcaattttc 


gacggagaga 


cggcccgcca 


ggcagcggta 


6420 


caacccctgg 


caacaggccc 


cacggatgtg 


cctatgtctt 


tcggatcgtt 


ttccgacgga 


6480 


gagattgatg 


agctgagccg 


cagagtaact 


gagtccgaac 


ccgtcctgtt 


tggatcattt 


6540 


gaaccgggcg 


aagtgaactc 


aattatatcg 


tcccgatcag 


ccgtatcttt 


tccactacgc 


6600 


aagcagagac 


gtagacgcag 


gagcaggagg 


actgaatact 


gactaaccgg 


ggtaggtggg 


6660 
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tacatatttt 


cgacggacac 


aggccctggg 


cacttgcaaa 


agaagtccgt 


tctgcagaac 


6720 


cagcttacag 


aaccgacctt 


ggagcgcaat 


gtcctggaaa 


gaattcatgc 


cccggtgctc 


6780 


gacacgtcga 


aagaggaaca 


actcaaactc 


aggtaccaga 


tgatgcccac 


cgaagccaac 


6840 


aaaagtaggt 


accagtctcg 


taaagtagaa 


aatcagaaag 


ccataaccac 


tgagcgacta 


6900 


ctgtcaggac 


tacgactgta 


taactctgcc 


acagatcagc 


cagaatgcta 


taagatcacc 


6960 


tatccgaaac 


cattgtactc 


cagtagcgta 


ccggcgaact 


actccgatcc 


acagttcgct 


7020 


gtagctgtct 


gtaacaacta 


tctgcatgag 


aactatccga 


cagtagcatc 


ttatcagatt 


7080 


actgacgagt 


acgatgctta 


cttggatatg 


gtagacgaga 


cagtcgcatg 


cctggatact 


7140 


gcaaccttct 


gccccgctaa 


gcttagaagt 


tacccgaaaa 


aacatgagta 


tagagccccg 


7200 


aatatccgca 


gtgcggttcc 


atcagcgatg 


cagaacacgc 


tacaaaatgt 


gctcattgcc 


7260 


gcaactaaaa 


gaaattgcaa 


cgtcacgcag 


atgcgtgaac 


tgccaacact 


ggactcagcg 


7320 


acattcaatg 


tcgaatgctt 


tcgaaaatat 


gcatgtaatg 


acgagtattg 


ggaggagttc 


7380 


gctcggaagc 


caattaggat 


taccactgag 


tttgtcaccg 


catatgtagc 


tagactgaaa 


7440 


ggccctaagg 


ccgccgcact 


atttgcaaag 


acgtataatt 


tggtcccatt 


gcaagaagtg 


7500 


cctatggata 


gattcgtcat 


ggacatgaaa 


agagacgtga 


aagttacacc 


aggcacgaaa 


7560 


cacacagaag 


aaagaccgaa 


agtacaagtg 


atacaagccg 


cagaacccct 


ggcgactgct 


7620 


tacttatgcg 


ggattcaccg 


ggaattagtg 


cgtaggctta 


cggccgtctt 


gcttccaaac 


7680 


attcacacgc 


tttttgacat 


gtcggcggag 


gattttgatg 


caatcatagc 


agaacacttc 


7740 


aagcaaggcg 


acccggtact 


ggagacggat 


atcgcatcat 


tcgacaaaag 


ccaagacgac 


7800 


gctatggcgt 


taaccggtct 


gatgatcttg 


gaggacctgg 


gtgtggatca 


accactactc 


7860 


gacttgatcg 


agtgcgcctt 


tggagaaata 


tcatccaccc 


atctacctac 


gggtactcgt 


7920 


tttaaattcg 


gggcgatgat 


gaaatccgga 


atgttcctca 


cactttttgt 


caacacagtt 


7980 


ttgaatgtcg 


ttatcgccag 


cagagtacta 


gaagagcggc 


ttaaaacgtc 


cagatgtgca 


8040 


gcgttcattg 


gcgacgacaa 


catcatacat 


ggagtagtat 


ctgacaaaga 


aatggctgag 


8100 


aggtgcgcca 


cctggctcaa 


catggaggtt 


aagatcatcg 


acgcagtcat 


cggtgagaga 


8160 


ccaccttact 


tctgcggcgg 


atttatcttg 


caagattcgg 


ttacttccac 


agcgtgccgc 


8220 


gtggcggatc 


ccctgaaaag 


gctgtttaag 


ttgggtaaac 


cgctcccagc 


cgacgacgag 


8280 


caagacgaag 


acagaagacg 


cgctctgcta 


gatgaaacaa 


aggcgtggtt 


tagagtaggt 


8340 


ataacaggca 


ctttagcagt 


ggccgtgacg 


acccggtatg 


aggtagacaa 


tattacacct 


8400 


gtcctactgg 


cattgagaac 


ttttgcccag 


agcaaaagag 


cattccaagc 


catcagaggg 


8460 


gaaataaagc 


atctctacgg 


tggtcctaaa 


tagtcagcat 


agtacatttc 


atctgactaa 


8520 


tactacaaca 


ccaccacctc 


tagacgcgta 


gatctcacgt 


gagcatgcag 


gccttgggcc 


8580 
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caatgatccg 


accagcaaaa 


ctcgatgtac 


ttccgaggaa 


ctgatgtgca 


taatgcatca 


8640 


ggctggtaca 


ttagatcccc 


gcttaccgcg 


ggcaatatag 


caacactaaa 


aactcgatgt 


8700 


acttccgagg 


aagcgcagtg 


cataatgctg 


cgcagtgttg 


ccacataacc 


actatattaa 


8760 


ccatttatct 


agcggacgcc 


aaaaactcaa 


tgtatttctg 


aggaagcgtg 


gtgcataatg 


8820 


ccacgcagcg 


tctgcataac 


ttttattatt 


tcttttatta 


atcaacaaaa 


ttttgttttt 


8880 


aacatttcaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaagggaa 


ttcccaactt 


8940 


gtttattgca 


gcttataatg 


gttacaaata 


aagcaatagc 


atcacaaatt 


tcacaaataa 


9000 


agcatttttt 


tcactgcatt 


ctagttgtgg 


tttgtccaaa 


ctcatcaatg 


tatcttatca 


9060 


tgtctggatc 


cgtcgagacg 


cgtccaattc 


gccctatagt 


gagtcgtatt 


acgcgcgctt 


9120 


ggcgtaatca 


tggtcatagc 


tgtttcctgt 


gtgaaattgt 


tatccgctca 


caattccaca 


9180 


caacatacga 


gccggaagca 


taaagtgtaa 


agcctggggt 


gcctaatgag 


tgagctaact 


9240 


cacattaatt 


gcgttgcgct 


cactgcccgc 


tttccagtcg 


ggaaacctgt 


cgtgccagct 


9300 


gcattaatga 


atcggccaac 


gcgcggggag 


aggcggtttg 


cgtattgggc 


gctcttccgc 


9360 


ttcctcgctc 


actgactcgc 


tgcgctcggt 


cgttcggctg 


cggcgagcgg 


tatcagctca 


9420 


ctcaaaggcg 


gtaatacggt 


tatccacaga 


atcaggggat 


aacgcaggaa 


agaacatgtg 


9480 


agcaaaaggc 


cagcaaaagg 


ccaggaaccg 


taaaaaggcc 


gcgttgctgg 


cgtttttcca 


9540 


taggctccgc 


ccccctgacg 


agcatcacaa 


aaatcgacgc 


tcaagtcaga 


ggtggcgaaa 


9600 


cccgacagga 


ctataaagat 


accaggcgtt 


tccccctgga 


agctccctcg 


tgcgctctcc 


9660 


tgttccgacc 


ctgccgctta 


ccggatacct 


gtccgccttt 


ctcccttcgg 


gaagcgtggc 


9720 


gctttctcaa 


tgctcacgct 


gtaggtatct 


cagttcggtg 


taggtcgttc 


gctccaagct 


9780 


gggctgtgtg 


cacgaacccc 


ccgttcagcc 


cgaccgctgc 


gccttatccg 


gtaactatcg 


9840 


tcttgagtcc 


aacccggtaa 


gacacgactt 


atcgccactg 


gcagcagcca 


ctggtaacag 


9900 


gattagcaga 


gcgaggtatg 


taggcggtgc 


tacagagttc 


ttgaagtggt 


ggcctaacta 


9960 


cggctacact 


agaaggacag 


tatttggtat 


ctgcgctctg 


ctgaagccag 


ttaccttcgg 


10020 


aaaaagagtt 


ggtagctctt 


gatccggcaa 


acaaaccacc 


gctggtagcg 


gtggtttttt 


10080 


tgtttgcaag 


cagcagatta 


cgcgcagaaa 


aaaaggatct 


caagaagatc 


ctttgatctt 


10140 


ttctacgggg 


tctgacgctc 


agtggaacga 


aaactcacgt 


taagggattt 


tggtcatgag 


10200 


attatcaaaa 


aggatcttca 


cctagatcct 


tttaaattaa 


aaatgaagtt 


ttaaatcaat 


10260 


ctaaagtata 


tatgagtaaa 


cttggtctga 


cagttaccaa 


tgcttaatca 


gtgaggcacc 


10320 


tatctcagcg 


atctgtctat 


ttcgttcatc 


catagttgcc 


tgactccccg 


tcgtgtagat 


10380 


aactacgata 


cgggagggct 


taccatctgg 


ccccagtgct 


gcaatgatac 


cgcgagaccc 


10440 
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acgctcaccg 


gctccagatt 


tatcagcaat 


aaaccagcca 


gccggaaggg 


ccgagcgcag 


10500 


aagtggtcct 


gcaactttat 


ccgcctccat 


ccagtctatt 


aattgttgcc 


gggaagctag 


10560 


agtaagtagt 


tcgccagtta 


atagtttgcg 


caacgttgtt 


gccattgcta 


caggcatcgt 


10620 


ggtgtcacgc 


tcgtcgtttg 


gtatggcttc 


attcagctcc 


ggttcccaac 


gatcaaggcg 


10680 


agttacatga 


tcccccatgt 


tgtgcaaaaa 


agcggttagc 


tccttcggtc 


ctccgatcgt 


10740 


tgtcagaagt 


aagttggccg 


cagtgttatc 


actcatggtt 


atggcagcac 


tgcataattc 


10800 


tcttactgtc 


atgccatccg 


taagatgctt 


ttctgtgact 


ggtgagtact 


caaccaagtc 


10860 


attctgagaa 


tagtgtatgc 


ggcgaccgag 


ttgctcttgc 


ccggcgtcaa 


tacgggataa 


10920 


taccgcgcca 


catagcagaa 


ctttaaaagt 


gctcatcatt 


ggaaaacgtt 


cttcggggcg 


10980 


aaaactctca 


aggatcttac 


cgctgttgag 


atccagttcg 


atgtaaccca 


ctcgtgcacc 


11040 


caactgatct 


tcagcatctt 


ttactttcac 


cagcgtttct 


gggtgagcaa 


aaacaggaag 


11100 


gcaaaatgcc 


gcaaaaaagg 


gaataagggc 


gacacggaaa 


tgttgaatac 


tcatactctt 


11160 


cctttttcaa 


tattattgaa 


gcatttatca 


gggttattgt 


ctcatgagcg 


gatacatatt 


11220 


tgaatgtatt 


tagaaaaata 


aacaaatagg 


ggttccgcgc 


acatttcccc 


gaaaagtgcc 


11280 


ac 
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<210> 2 

<211> 13068 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> cDNA 

<400> 2 



ctgacgcgcc 


ctgtagcggc 


gcattaagcg 


cggcgggtgt 


ggtggttacg 


cgcagcgtga 


60 


ccgctacact 


tgccagcgcc 


ctagcgcccg 


ctcctttcgc 


tttcttccct 


tcctttctcg 


120 


ccacgttcgc 


cggcggatcc 


ggccattagc 


catattattc 


attggttata 


tagcataaat 


180 


caatattggc 


tattggccat 


tgcatacgtt 


gtatccatat 


cataatatgt 


acatttatat 


240 


tggctcatgt 


ccaacattac 


cgccatgttg 


acattgatta 


ttgactagtt 


attaatagta 


300 


atcaattacg 


gggtcattag 


ttcatagccc 


atatatggag 


ttccgcgtta 


cataacttac 


360 


ggtaaatggc 


ccgcctggct 


gaccgcccaa 


cgacccccgc 


ccattgacgt 


caataatgac 


420 


gtatgttccc 


atagtaacgc 


caatagggac 


tttccattga 


cgtcaatggg 


tggagtattt 


480 


acggtaaact 


gcccacttgg 


cagtacatca 


agtgtatcat 


atgccaagta 


cgccccctat 


540 


tgacgtcaat 


gacggtaaat 


ggcccgcctg 


gcattatgcc 


cagtacatga 


ccttatggga 


600 


ctttcctact 


tggcagtaca 


tctacgtatt 


agtcatcgct 


attaccatgg 


tgatgcggtt 


660 


ttggcagtac 


atcaatgggc 


gtggatagcg 


gtttgactca 


cggggatttc 


caagtctcca 


720 
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ccccattgac 


gtcaatggga 


gtttgttttg 


gcaccaaaat 


caacgggact 


ttccaaaatg 


780 


tcgtaacaac 


tccgccccat 


tgacgcaaat 


gggcggtagg 


cgtgtacggt 


gggaggtcta 


840 


tataagcaga 


gctcgtttag 


tgaaccgtat 


tgacggcgta 


gtacacacta 


ttgaatcaaa 


900 


cagccgacca 


attgcactac 


catcacaatg 


gagaagccag 


tagtaaacgt 


agacgtagac 


960 


ccccagagtc 


cgtttgtcgt 


gcaactgcaa 


aaaagcttcc 


cgcaatttga 


ggtagtagca 


1020 


cagcaggtca 


ctccaaatga 


ccatgctaat 


gccagagcat 


tttcgcatct 


ggccagtaaa 


1080 


ctaatcgagc 


tggaggttcc 


taccacagcg 


acgatcttgg 


acataggcag 


cgcaccggct 


1140 


cgtagaatgt 


tttccgagca 


ccagtatcat 


tgtgt ctgcc 


ccatgcgtag 


tccagaagac 


1200 


ccggaccgca 


tgatgaaata 


tgccagtaaa 


ctggcggaaa 


aagcgtgcaa 


gattacaaac 


1260 


aagaacttgc 


atgagaagat 


taaggatctc 


cggaccgtac 


ttgatacgcc 


ggatgctgaa 


1320 


acaccatcgc 


tctgctttca 


caacgatgtt 


acctgcaaca 


tgcgtgccga 


atattccgtc 


1380 


atgcaggacg 


tgtatatcaa 


cgctcccgga 


actatctatc 


atcaggctat 


gaaaggcgtg 


1440 


cggaccctgt 


actggattgg 


cttcgacacc 


acccagttca 


tgttctcggc 


tatggcaggt 


1500 


tcgtaccctg 


cgtacaacac 


caactgggcc 


gacgagaaag 


tccttgaagc 


gcgtaacatc 


1560 


ggactttgca 


gcacaaagct 


gagtgaaggt 


aggacaggaa 


aattgtcgat 


aatgaggaag 


1620 


aaggagttga 


agcccgggtc 


gcgggtttat 


ttctccgtag 


gatcgacact 


ttatccagaa 


1680 


cacagagcca 


gcttgcagag 


ctggcatctt 


ccatcggtgt 


tccacttgaa 


tggaaagcag 


1740 


tcgtacactt 


gccgctgtga 


tacagtggtg 


agttgcgaag 


gctacgtagt 


gaagaaaatc 


1800 


accatcagtc 


ccgggatcac 


gggagaaacc 


gtgggatacg 


cggttacaca 


caatagcgag 


18 60 


ggcttcttgc 


tatgcaaagt 


tactgacaca 


gtaaaaggag 


aacgggtatc 


gttccctgtg 


1920 


tgcacgtaca 


tcccggccac 


catatgcgat 


cagatgactg 


gtataatggc 


cacggatata 


1980 


tcacctgacg 


atgcacaaaa 


acttctggtt 


gggctcaacc 


agcgaattgt 


cattaacggt 


2040 


aggactaaca 


ggaacaccaa 


caccatgcaa 


aattaccttc 


tgccgatcat 


agcacaaggg 


2100 


ttcagcaaat 


gggctaagga 


gcgcaaggat 


gatcttgata 


acgagaaaat 


gctgggtact 


2160 


agagaacgca 


agcttacgta 


tggctgcttg 


tgggcgtttc 


gcactaagaa 


agtacattcg 


2220 


ttttatcgcc 


cacctggaac 


gcagacctgc 


gtaaaagtcc 


cagcctcttt 


tagcgctttt 


2280 


cccatgtcgt 


ccgtatggac 


gacctctttg 


cccatgtcgc 


tgaggcagaa 


attgaaactg 


2340 


gcattgcaac 


caaagaagga 


ggaaaaactg 


ctgcaggtct 


cggaggaatt 


agtcatggag 


2400 


gccaaggctg 


cttttgagga 


tgctcaggag 


gaagccagag 


cggagaagct 


ccgagaagca 


2460 


cttccaccat 


tagtggcaga 


caaaggcatc 


gaggcagccg 


cagaagttgt 


ctgcgaagtg 


2520 


gaggggctcc 


aggcggacat 


cggagcagca 


ttagttgaaa 


ccccgcgcgg 


tcacgtaagg 


2580 
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ataatacctc 


aagcaaatga 


ccgtatgatc 


ggacagtata 


tcgttgtctc 


gccaaactct 


2640 


gtgctgaaga 


atgccaaact 


cgcaccagcg 


cacccgctag 


cagatcaggt 


taagatcata 


2700 


acacactccg 


gaagatcagg 


aaggtacgcg 


gtcgaaccat 


acgacgctaa 


agtactgatg 


2760 


ccagcaggag 


gtgccgtacc 


atggccagaa 


ttcctagcac 


tgagtgagag 


cgccacgtta 


2820 


gtgtacaacg 


aaagagagtt 


tgtgaaccgc 


aaactatacc 


acattgccat 


gcatggcccc 


2880 


gccaagaata 


cagaagagga 


gcagtacaag 


gttacaaagg 


cagagcttgc 


agaaacagag 


2940 


tacgtgtttg 


acgtggacaa 


gaagcgttgc 


gttaagaagg 


aagaagcctc 


aggtctggtc 


3000 


ctctcgggag 


aactgaccaa 


ccctccctat 


catgagctag 


ctctggaggg 


actgaagacc 


3060 


cgacctgcgg 


tcccgtacaa 


ggtcgaaaca 


ataggagtga 


taggcacacc 


ggggtcgggc 


3120 


aagtcagcta 


ttatcaagtc 


aactgtcacg 


gcacgagatc 


ttgttaccag 


cggaaagaaa 


3180 


gaaaattgtc 


gcgaaattga 


ggccgacgtg 


ctaagactga 


ggggtatgca 


gattacgtcg 


3240 


aagacagtag 


attcggttat 


gctcaacgga 


tgccacaaag 


ccgtagaagt 


gctgtacgtt 


3300 


gacgaagcgt 


tcgcgtgcca 


cgcaggagca 


ctacttgcct 


tgattgctat 


cgtcaggccc 


3360 


cgcaagaagg 


tagtactatg 


cggagacccc 


atgcaatgcg 


gattcttcaa 


catgatgcaa 


3420 


ctaaaggtac 


atttcaatca 


ccctgaaaaa 


gacatatgca 


ccaagacatt 


ctacaagtat 


3480 


atctcccggc 


gttgcacaca 


gccagttaca 


gctattgtat 


cgacactgca 


ttacgatgga 


3540 


aagatgaaaa 


ccacgaaccc 


gtgcaagaag 


aacattgaaa 


tcgatattac 


aggggccaca 


3600 


aagccgaagc 


caggggatat 


catcctgaca 


tgtttccgcg 


ggtgggttaa 


gcaattgcaa 


3660 


atcgactatc 


ccggacatga 


agtaatgaca 


gccgcggcct 


cacaagggct 


aaccagaaaa 


3720 


ggagtgtatg 


ccgtccggca 


aaaagtcaat 


gaaaacccac 


tgtacgcgat 


cacatcagag 


3780 


catgtgaacg 


tgttgctcac 


ccgcactgag 


gacaggctag 


tgtggaaaac 


cttgcagggc 


3840 


gacccatgga 


ttaagcagct 


cactaacata 


cctaaaggaa 


actttcaggc 


tactatagag 


3900 


gactgggaag 


ctgaacacaa 


gggaataatt 


gctgcaataa 


acagccccac 


tccccgtgcc 


3960 


aatccgttca 


gctgcaagac 


caacgtttgc 


tgggcgaaag 


cattggaacc 


gatactagcc 


4020 


acggccggta 


tcgtacttac 


cggttgccag 


tggagcgaac 


tgttcccaca 


gtttgcggat 


4080 


gacaaaccac 


attcggccat 


ttacgcctta 


gacgtaattt 


gcattaagtt 


tttcggcatg 


4140 


gacttgacaa 


gcggactgtt 


ttctaaacag 


agcatcccac 


taacgtacca 


tcccgccgat 


4200 


tcagcgaggc 


cggtagctca 


ttgggacaac 


agcccaggaa 


cccgcaagta 


tgggtacgat 


4260 


cacgccattg 


ccgccgaact 


ctcccgtaga 


tttccggtgt 


tccagctagc 


tgggaagggc 


4320 


acacaacttg 


atttgcagac 


ggggagaacc 


agagttatct 


ctgcacagca 


taacctggtc 


4380 


ccggtgaacc 


gcaatcttcc 


tcacgcctta 


gtccccgagt 


acaaggagaa 


gcaacccggc 


4440 


ccggtcgaaa 


aattcttgaa 


ccagttcaaa 


caccactcag 


tacttgtggt 


atcagaggaa 


4500 
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aaaattgaag 


ctccccgtaa 


gagaatcgaa 


tggatcgccc 


cgattggcat 


agccggtgca 


4560 


gataagaact 


acaacctggc 


tttcgggttt 


ccgccgcagg 


cacggtacga 


cctggtgttc 


4620 


atcaacattg 


gaactaaata 


cagaaaccac 


cactttcagc 


agtgcgaaga 


ccatgcggcg 


4680 


accttaaaaa 


ccctttcgcg 


ttcggccctg 


aattgtttaa 


actcaggagg 


caccctcgtg 


4740 


gtgaagtcct 


atggctacgc 


cgaccgcaac 


agtgaggacg 


tagtcaccgc 


tcttgccaga 


4800 


aagtttgtca 


gggtgtctgc 


agcgagacca 


gattgtgtct 


caagcaatac 


agaaatgtac 


4860 


ctgattttcc 


gacaactaga 


caacagccgt 


acacggcaat 


tcaccccgca 


ccatctgaat 


4920 


tgcgtgattt 


cgtccgtgta 


tgagggtaca 


agagatggag 


ttggagccgc 


gccgtcatac 


4980 


cgcaccaaaa 


gggagaatat 


tgctgactgt 


caagaggaag 


cagttgtcaa 


cgcagccaat 


5040 


ccgctgggta 


gaccaggcga 


aggagtctgc 


cgtgccatct 


ataaacgttg 


gccgaccagt 


5100 


tttaccgatt 


cagccacgga 


gacaggcacc 


gcaagaatga 


ctgtgtgcct 


aggaaagaaa 


5160 


gtgatccacg 


cggtcggccc 


tgatttccgg 


aagcacccag 


aagcagaagc 


cttgaaattg 


5220 


ctacaaaacg 


cctaccatgc 


agtggcagac 


ttagtaaatg 


aacataacat 


caagtctgtc 


5280 


gccattccac 


tgctatctac 


aggcatttac 


gcagccggaa 


aagaccgcct 


tgaagtatca 


5340 


cttaactgct 


tgacaaccgc 


gctagacaga 


actgacgcgg 


acgtaaccat 


ctattgcctg 


5400 


gataagaagt 


ggaaggaaag 


aatcgacgcg 


gcactccaac 


ttaaggagtc 


tgtaacagag 


5460 


ctgaaggatg 


aagatatgga 


gatcgacgat 


gagttagtat 


ggattcatcc 


agacagttgc 


5520 


ttgaagggaa 


gaaagggatt 


cagtactaca 


aaaggaaaat 


tgtattcgta 


cttcgaaggc 


5580 


accaaattcc 


atcaagcagc 


aaaagacatg 


gcggagataa 


aggtcctgtt 


ccctaatgac 


5640 


caggaaagta 


atgaacaact 


gtgtgcctac 


atattgggtg 


agaccatgga 


agcaatccgc 


5700 


gaaaagtgcc 


cggtcgacca 


taacccgtcg 


tctagcccgc 


ccaaaacgtt 


gccgtgcctt 


5760 


tgcatgtatg 


ccatgacgcc 


agaaagggtc 


cacagactta 


gaagcaataa 


cgtcaaagaa 


5820 


gttacagtat 


gctcctccac 


cccccttcct 


aagcacaaaa 


ttaagaatgt 


tcagaaggtt 


5880 


cagtgcacga 


aagtagtcct 


gtttaatccg 


cacactcccg 


cattcgttcc 


cgcccgtaag 


5940 


tacatagaag 


tgccagaaca 


gcctaccgct 


cctcctgcac 


aggccgagga 


ggcccccgaa 


6000 


gttgtagcga 


caccgtcacc 


atctacagct 


gataacacct 


cgcttgatgt 


cacagacatc 


6060 


tcactggata 


tggatgacag 


tagcgaaggc 


tcactttttt 


cgagctttag 


cggatcggac 


6120 


aactctatta 


ctagtatgga 


cagttggtcg 


tcaggaccta 


gttcactaga 


gatagtagac 


6180 


cgaaggcagg 


tggtggtggc 


tgacgttcat 


gccgtccaag 


agcctgcccc 


tattccaccg 


6240 


ccaaggctaa 


agaagatggc 


ccgcctggca 


gcggcaagaa 


aagagcccac 


tccaccggca 


6300 


agcaatagct 


ctgagtccct 


ccacctctct 


tttggtgggg 


tatccatgtc 


cctcggatca 


6360 
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attttcgacg 


gagagacggc 


ccgccaggca 


gcggtacaac 


ccctggcaac 


aggccccacg 


6420 


gatgtgccta 


tgtctttcgg 


atcgttttcc 


gacggagaga 


ttgatgagct 


gagccgcaga 


6480 


gtaactgagt 


ccgaacccgt 


cctgtttgga 


tcatttgaac 


cgggcgaagt 


gaactcaatt 


6540 


atatcgtccc 


gatcagccgt 


atcttttcct 


ctacgcaagc 


agagacgtag 


acgcaggagc 


6600 


aggaggactg 


aatactgact 


aaccggggta 


ggtgggtaca 


tattttcgac 


ggacacaggc 


6660 


cctgggcact 


tgcaaaagaa 


gtccgttctg 


cagaaccagc 


ttacagaacc 


gaccttggag 


6720 


cgcaatgtcc 


tggaaagaat 


tcatgccccg 


gtgctcgaca 


cgtcgaaaga 


ggaacaactc 


6780 


aaactcaggt 


accagatgat 


gcccaccgaa 


gccaacaaaa 


gtaggtacca 


gtctcgtaaa 


6840 


gtagaaaatc 


agaaagccat 


aaccactgag 


cgactactgt 


caggactacg 


actgtataac 


6900 


tctgccacag 


atcagccaga 


atgctataag 


atcacctatc 


cgaaaccatt 


gtactccagt 


6960 


agcgtaccgg 


cgaactactc 


cgatccacag 


ttcgctgtag 


ctgtctgtaa 


caactatctg 


7020 


catgagaact 


atccgacagt 


agcatcttat 


cagattactg 


acgagtacga 


tgcttacttg 


7080 


gatatggtag 


acgagacagt 


cgcatgcctg 


gatactgcaa 


ccttctgccc 


cgctaagctt 


7140 


agaagttacc 


cgaaaaaaca 


tgagtataga 


gccccgaata 


tccgcagtgc 


ggttccatca 


7200 


gcgatgcaga 


acacgctaca 


aaatgtgctc 


attgccgcaa 


ctaaaagaaa 


ttgcaacgtc 


7260 


acgcagatgc 


gtgaactgcc 


aacactggac 


t cagcgacat 


tcaatgtcga 


atgctttcga 


7320 


aaatatgcat 


gtaatgacga 


gtattgggag 


gagttcgctc 


ggaagccaat 


taggattacc 


7380 


actgagtttg 


tcaccgcata 


tgtagctaga 


ctgaaaggcc 


ctaaggccgc 


cgcactattt 


7440 


gcaaagacgt 


ataatttggt 


cccattgcaa 


gaagtgccta 


tggatagatt 


cgtcatggac 


7500 


atgaaaagag 


acgtgaaagt 


tacaccaggc 


acgaaacaca 


cagaagaaag 


accgaaagta 


7560 


caagtgatac 


aagccgcaga 


acccctggcg 


actgcttact 


tatgcgggat 


tcaccgggaa 


7620 


ttagtgcgta 


ggcttacggc 


cgtcttgctt 


ccaaacattc 


acacgctttt 


tgacatgtcg 


7680 


gcggaggatt 


ttgatgcaat 


catagcagaa 


cacttcaagc 


aaggcgaccc 


ggtactggag 


7740 


acggatatcg 


catcattcga 


caaaagccaa 


gacgacgcta 


tggcgttaac 


cggtctgatg 


7800 


atcttggagg 


acctgggtgt 


ggatcaacca 


ctactcgact 


tgatcgagtg 


cgcctttgga 


7860 


gaaatatcat 


ccacccatct 


acctacgggt 


actcgtttta 


aattcggggc 


gatgatgaaa 


7920 


tccggaatgt 


tcctcacact 


ttttgtcaac 


acagttttga 


atgtcgttat 


cgccagcaga 


7980 


gtactagaag 


agcggcttaa 


aacgtccaga 


tgtgcagcgt 


tcattggcga 


cgacaacatc 


8040 


atacatggag 


tagtatctga 


caaagaaatg 


gctgagaggt 


gcgccacctg 


gctcaacatg 


8100 


gaggttaaga 


tcatcgacgc 


agtcatcggt 


gagagaccac 


cttacttctg 


cggcggattt 


8160 


atcttgcaag 


attcggttac 


ttccacagcg 


tgccgcgtgg 


cggatcccct 


gaaaaggctg 


8220 


tttaagttgg 


gtaaaccgct 


cccagccgac 


gacgagcaag 


acgaagacag 


aagacgcgct 


8280 
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ctgctagatg 
gtgacgaccc 
gcccagagca 
cctaaatagt 
ttaattaact 
gtgtgacata 
ttttaagtgt 
cctatggaac 
gctcagaaga 
ctccaaaaaa 
ttttgagtca 
aggaaaaagc 
atgatccgac 
ctggtacatt 
ttccgaggaa 
atttatctag 
acgcagcgtc 
catttcaaaa 
tgcagcttat 
tttttcactg 
gatccgtcga 
gcagaagtat 
gctccccagc 
cgcccctaac 
atggctgact 
tccagaagta 
tgcaggtcgg 
cacaaggaga 
cgacgtcccc 
ccacaccgtc 
cacgcgcgtc 



aaacaaaggc 
ggtatgaggt 
aaagagcatt 
cagcatagta 
cgaggcgcgc 
attggacaaa 
ataatgtgtt 
tgatgaatgg 
aatgccatct 
gaagagaaag 
tgctgtgttt 
tgcactgcta 
cagcaaaact 
agatccccgc 
gcgcagtgca 
cggacgccaa 
tgcataactt 
aaaaaaaaaa 
aatggttaca 
cattctagtt 
gacgcgtgcg 
gcaaagcatg 
aggcagaagt 
tccgcccatc 
aatttttttt 
gtgaggaggc 
ccgccacgac 
cgaccttcca 
cgggccgtac 
gacccggacc 
gggctcgaca 



gtggtttaga 
agacaatatt 
ccaagccatc 
catttcatct 
cgggcccaga 
ctacctacag 
aaactactga 
gagcagtggt 
agtgatgatg 
gtagaagacc 
agtaatagaa 
tacaagaaaa 
cgatgtactt 
ttaccgcggg 
taatgctgcg 
aaactcaatg 
ttattatttc 
aaaaaaaaaa 
aataaagcaa 
gtggtttgtc 
gccgcgttag 
catctcaatt 
atgcaaagca 
ccgcccctaa 
atttatgcag 
ttttttggag 
cggtgccgcc 
tgaccgagta 
gcaccctcgc 
gccacatcga 
tcggcaaggt 



gtaggtataa 
acacctgtcc 
agaggggaaa 
gactaatacc 
ggatctttgt 
agatttaaag 
ttctaattgt 
ggaatgcctt 
aggctactgc 
ccaaggactt 
ctcttgcttg 
ttatggaaaa 
ccgaggaact 
caatatagca 
cagtgttgcc 
tatttctgag 
ttttattaat 
aaaaaaaagg 
tagcatcaca 
caaactcatc 
ggtgtggaaa 
agtcagcaac 
tgcatctcaa 
ctccgcccag 
aggccgaggc 
gcctaggctt 
accatcccct 
caagcccacg 
cgccgcgttc 
gcgggtcacc 
gtgggtcgcg 



caggcacttt 
tactggcatt 
taaagcatct 
acaacaccac 
gaaggaacct 
ctctaaggta 
ttgtgtattt 
taatgaggaa 
tgactctcaa 
tccttcagaa 
ctttgctatt 
atatttgatg 
gatgtgcata 
acactaaaaa 
acataaccac 
gaagcgtggt 
caacaaaatt 
ggaattccca 
aatttcacaa 
aatgtatctt 
gtccccaggc 
caggtgtgga 
ttagtcagca 
ttccgcccat 
cgcctcggcc 
ttgcaaaaag 
gacccacgcc 
gtgcgcctcg 
gccgactacc 
gagctgcaag 
gacgacggcg 



agcagtggcc 
gagaactttt 
ctacggtggt 
cacctctaga 
tacttctgtg 
aatataaaat 
tagattccaa 
aacctgtttt 
cattctactc 
ttgctaagtt 
tacaccacaa 
tagcggccca 
atgcatcagg 
ctcgatgtac 
tatattaacc 
gcataatgcc 
ttgtttttaa 
acttgtttat 
ataaagcatt 
atcatgtctg 
tccccagcag 
aagtccccag 
accatagtcc 
t ctccgcccc 
tctgagctat 
cttgcatgcc 
cctgacccct 
ccacccgcga 
ccgccacgcg 
aactcttcct 
ccgcggtggc 



8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 



13 



ggtctggacc 


acgccggaga 


gcgtcgaagc 


gggggcggtg 


ttcgccgaga 


tcggcccgcg 


10200 


catggccgag 


ttgagcggtt 


cccggctggc 


cgcgcagcaa 


cagatggaag 


gcctcctggc 


10260 


gccgcaccgg 


cccaaggagc 


ccgcgtggtt 


cctggccacc 


gtcggcgtct 


cgcccgacca 


10320 


ccagggcaag 


ggtctgggca 


gcgccgtcgt 


gctccccgga 


gtggaggcgg 


ccgagcgcgc 


10380 


cggggtgccc 


gccttcctgg 


agacctccgc 


gccccgcaac 


ctccccttct 


acgagcggct 


10440 


cggcttcacc 


gtcaccgccg 


acgtcgaggt 


gcccgaagga 


ccgcgcacct 


ggtgcatgac 


10500 


ccgcaagccc 


ggtgcctgac 


gcccgcccca 


cgacccgcag 


cgcccgaccg 


aaaggagcgc 


10560 


acgaccccat 


ggctccgacc 


gaagccaccc 


ggggcggccc 


cgccgacccc 


gcacccgccc 


10620 


ccgaggccca 


ccgactctag 


ctagaggatc 


ataatcagcc 


ataccacatt 


tgtagaggtt 


10680 


ttacttgctt 


taaaaaacct 


cccacacctc 


cccctgaacc 


tgaaacataa 


aatgaatgca 


10740 


attgttgttg 


ttaacttgtt 


tattgcagct 


tataatggtt 


acaaataaag 


caatagcatc 


10800 


acaaatttca 


caaataaagc 


atttttttca 


ctgcattcta 


gttgtggttt 


gtccaacgcg 


10860 


tcggtaccag 


cttttgttcc 


ctttagtgag 


ggttaatttc 


gagcttggcg 


taatcatggt 


10920 


catagctgtt 


tcctgtgtga 


aattgttatc 


cgctcacaat 


tccacacaac 


atacgagccg 


10980 


gaagcataaa 


gtgtaaagcc 


tggggtgcct 


aatgagtgag 


ctaactcaca 


ttaattgcgt 


11040 


ttcgctcact 


gcccgctttc 


cagtcgggaa 


acctgtcgtg 


ccagctgcat 


taatgaatcg 


11100 


gccaacgcgc 


ggggagaggc 


ggtttgcgta 


ttgggcgctc 


ttccgcttcc 


tcgctcactg 


11160 


actcgctgcg 


ctcggtcgtt 


cggctgcggc 


gagcggtatc 


agctcactca 


aaggcggtaa 


11220 


tacggttatc 


cacagaatca 


ggggataacg 


caggaaagaa 


catgtgagca 


aaaggccagc 


11280 


aaaaggccag 


gaaccgtaaa 


aaggccgcgt 


tgctggcgtt 


tttccatagg 


ctccgccccc 


11340 


ctgacgagca 


tcacaaaaat 


cgacgctcaa 


gtcagaggtg 


gcgaaacccg 


acaggactat 


11400 


aaagatacca 


ggcgtttccc 


cctggaagct 


ccctcgtgcg 


ctctcctgtt 


ccgaccctgc 


11460 


cgcttaccgg 


atacctgtcc 


gcctttctcc 


cttcgggaag 


cgtggcgctt 


tctcatagct 


11520 


cacgctgtag 


gtatctcagt 


tcggtgtagg 


tcgttcgctc 


caagctgggc 


tgtgtgcacg 


11580 


aaccccccgt 


tcagcccgac 


cgctgcgcct 


tatccggtaa 


ctatcgtctt 


gagtccaacc 


11640 


cggtaagaca 


cgacttatcg 


ccactggcag 


cagccactgg 


taacaggatt 


agcagagcga 


11700 


ggtatgtagg 


cggtgctaca 


gagttcttga 


agtggtggcc 


taactacggc 


tacactagaa 


11760 


gaacagtatt 


tggtatctgc 


gctctgctga 


agccagttac 


cttcggaaaa 


agagttggta 


11820 


gctcttgatc 


cggcaaacaa 


accaccgctg 


gtagcggtgg 


tttttttgtt 


tgcaagcagc 


11880 


agattacgcg 


cagaaaaaaa 


ggatctcaag 


aagatccttt 


gatcttttct 


acggggtctg 


11940 


acgctcagtg 


gaacgaaaac 


tcacgttaag 


ggattttggt 


catgagatta 


tcaaaaagga 


12000 


tcttcaccta 


gatcctttta 


aattaaaaat 


gaagttttaa 


atcaatctaa 


agtatatatg 


12060 



■• 14 



agtaaacttg 


gtctgacagt 


taccaatgct 


taatcagtga 


ggcacctatc 


tcagcgatct 


12120 


gtctatttcg 


ttcatccata 


gttgcctgac 


tccccgtcgt 


gtagataact 


acgatacggg 


12180 


agggcttacc 


atctggcccc 


agtgctgcaa 


tgataccgcg 


agacccacgc 


tcaccggctc 


1224 0 


cagatttatc 


agcaataaac 


cagccagccg 


gaagggccga 


gcgcagaagt 


ggtcctgcaa 


12300 


ctttatccgc 


ctccatccag 


tctattaatt 


gttgccggga 


agctagagta 


agtagttcgc 


12360 


cagttaatag 


tttgcgcaac 


gttgttgcca 


ttgctacagg 


catcgtggtg 


tcacgctcgt 


12420 


cgtttggtat 


ggcttcattc 


agctccggtt 


cccaacgatc 


aaggcgagtt 


acatgatccc 


12480 


ccatgttgtg 


caaaaaagcg 


gttagctcct 


tcggtcctcc 


gatcgttgtc 


agaagtaagt 


12540 


tggccgcagt 


gttatcactc 


atggttatgg 


cagcactgca 


taattctctt 


actgtcatgc 


12600 


catccgtaag 


atgcttttct 


gtgactggtg 


agtactcaac 


caagtcattc 


tgagaatagt 


12660 


gtatgcggcg 


accgagttgc 


tcttgcccgg 


cgtcaatacg 


ggataatacc 


gcgccacata 


12720 


gcagaacttt 


aaaagtgctc 


atcattggaa 


aacgttcttc 


ggggcgaaaa 


ctctcaagga 


12780 


tcttaccgct 


gttgagatcc 


agttcgatgt 


aacccactcg 


tgcacccaac 


tgatcttcag 


12840 


catcttttac 


tttcaccagc 


gtttctgggt 


gagcaaaaac 


aggaaggcaa 


aatgccgcaa 


12900 


aaaagggaat 


aagggcgaca 


cggaaatgtt 


gaatactcat 


actcttcctt 


tttcaatatt 


12960 


attgaagcat 


ttatcagggt 


tattgtctca 


tgagcggata 


catatttgaa 


tgtatttaga 


13020 


aaaataaaca 


aataggggtt 


ccgcgcacat 


ttccccgaaa 


agtgccac 




13068 



<210> 3 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 3 

aacattgaaa tcgatattac agggg 25 

<210> 4 

<211> 20 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer 

<400> 4 

cgggttatgg tcgaccgggc 20 

<210> 5 

<211> 40 

<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> primer 
<400> 5 

gtgccctccc ctgagtttaa acaattcagg gccgaacgcg 40 



<210> 


6 


<211> 


32 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


primer 


<400> 


6 



gaattgttta aactcaggag gcaccctcgt gg 32 



<210> 7 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer 

<400> 7 

ggtagacgag acagtcgcat gcctggatac 30 



<210> 8 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 8 

gtatccaggc atgcgactgt ctcgtctacc 30 



<210> 


9 


<211> 


25 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


primer 


<400> 


9 



cagaccggtt aacgccatag cgtcg 25 



<210> 10 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



<220> 



16 



<223> primer 

<400> 10 

ctctattact agtatggaca gttgg 25 

<210> 11 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 11 

ctagattaat taactcgagg cgcgccg 27 

<210> 12 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 12 

ggcccggcgc gcctcgagtt aattaat 27 

<210> 13 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 13 

ataagaatgc cggcggatcc ggccattagc 30 

<210> 14 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 14 

ccgtcaatac ggttcactaa acgagctctg cttatataga cc 42 



<210> 15 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 15 



17 



gctcgtttag tgaaccgtat tgacggcgta gtacacac 38 



<210> 16 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 16 

acgtcggcct caatttcgcg 20 



<210> 17 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 17 

gcgcgcgggc ccagaggatc tttgtgaagg 30 



<210> 18 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 

<400> 18 

gcgcgcgcgg ccgctacatc aaatattttt cc 

<210> 19 

<211> 39 

<212> DNA 

<213> Artificial Sequence 



32 



<220> 

<223> primer 

<400> 19 

acgtacgcgt gcggccgcgt tagggtgtgg aaagtcccc 39 

<210> 20 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<400> 20 

acgtacgcgt tggacaaacc acaactagaa tgc 



33 
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<210> 21 

<211> 15081 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> cDNA 

<400> 21 



ctgacgcgcc 


ctgtagcggc 


gcattaagcg 


cggcgggtgt 


ggtggttacg 


cgcagcgtga 


60 


ccgctacact 


tgccagcgcc 


ctagcgcccg 


ctcctttcgc 


tttcttccct 


tcctttctcg 


120 


ccacgttcgc 


cggcggatcc 


ggccattagc 


catattattc 


attggttata 


tagcataaat 


180 


caatattggc 


tattggccat 


tgcatacgtt 


gtatccatat 


cataatatgt 


acatttatat 


240 


tggctcatgt 


ccaacattac 


cgccatgttg 


acattgatta 


ttgactagtt 


attaatagta * 


300 


atcaattacg 


gggtcattag 


ttcatagccc 


atatatggag 


ttccgcgtta 


cataacttac 


360 


ggtaaatggc 


ccgcctggct 


gaccgcccaa 


cgacccccgc 


ccattgacgt 


caataatgac 


420 


gtatgttccc 


atagtaacgc 


caatagggac 


tttccattga 


cgtcaatggg 


tggagtattt 


480 


acggtaaact 


gcccacttgg 


cagtacatca 


agtgtatcat 


atgccaagta 


cgccccctat 


540 


tgacgtcaat 


gacggtaaat 


ggcccgcctg 


gcattatgcc 


cagtacatga 


ccttatggga 


600 


ctttcctact 


tggcagtaca 


tctacgtatt 


agtcatcgct 


attaccatgg 


tgatgcggtt 


660 


ttggcagtac 


atcaatgggc 


gtggatagcg 


gtttgactca 


cggggatttc 


caagtctcca 


720 


ccccattgac 


gtcaatggga 


gtttgttttg 


gcaccaaaat 


caacgggact 


ttccaaaatg 


780 


tcgtaacaac 


tccgccccat 


tgacgcaaat 


gggcggtagg 


cgtgtacggt 


gggaggtcta 


840 


tataagcaga 


gctcgtttag 


tgaaccgtat 


tgacggcgta 


gtacacacta 


ttgaatcaaa 


900 


cagccgacca 


attgcactac 


catcacaatg 


gagaagccag 


tagtaaacgt 


agacgtagac 


960 


ccccagagtc 


cgtttgtcgt 


gcaactgcaa 


aaaagcttcc 


cgcaatttga 


ggtagtagca 


1020 


cagcaggtca 


ctccaaatga 


ccatgctaat 


gccagagcat 


tttcgcatct 


ggccagtaaa 


1080 


ctaatcgagc 


tggaggttcc 


taccacagcg 


acgatcttgg 


acataggcag 


cgcaccggct 


1140 


cgtagaatgt 


tttccgagca 


ccagtatcat 


tgtgtctgcc 


ccatgcgtag 


tccagaagac 


1200 


ccggaccgca 


tgatgaaata 


tgccagtaaa 


ctggcggaaa 


aagcgtgcaa 


gattacaaac 


1260 


aagaacttgc 


atgagaagat 


taaggatctc 


cggaccgtac 


ttgatacgcc 


ggatgctgaa 


1320 


acaccatcgc 


tctgctttca 


caacgatgtt 


acctgcaaca 


tgcgtgccga 


atattccgtc 


1380 


atgcaggacg 


tgtatatcaa 


cgctcccgga 


actatctatc 


atcaggctat 


gaaaggcgtg 


1440 


cggaccctgt 


actggattgg 


cttcgacacc 


acccagttca 


tgttctcggc 


tatggcaggt 


1500 


tcgtaccctg 


cgtacaacac 


caactgggcc 


gacgagaaag 


tccttgaagc 


gcgtaacatc 


1560 


ggactttgca 


gcacaaagct 


gagtgaaggt 


aggacaggaa 


aattgtcgat 


aatgaggaag 


1620 
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aaggagttga 


agcccgggtc 


gcgggtttat 


ttctccgtag 


gatcgacact 


ttatccagaa 


1680 


cacagagcca 


gcttgcagag 


ctggcatctt 


ccatcggtgt 


tccacttgaa 


tggaaagcag 


1740 


tcgtacactt 


gccgctgtga 


tacagtggtg 


agttgcgaag 


gctacgtagt 


gaagaaaatc 


1800 


accatcagtc 


ccgggatcac 


gggagaaacc 


gtgggatacg 


cggttacaca 


caatagcgag 


1860 


ggcttcttgc 


tatgcaaagt 


tactgacaca 


gtaaaaggag 


aacgggtatc 


gttccctgtg 


1920 


tgcacgtaca 


tcccggccac 


catatgcgat 


cagatgactg 


gtataatggc 


cacggatata 


1980 


tcacctgacg 


atgcacaaaa 


acttctggtt 


gggctcaacc 


agcgaattgt 


cattaacggt 


2040 


aggactaaca 


ggaacaccaa 


caccatgcaa 


aattaccttc 


tgccgatcat 


agcacaaggg 


2100 


ttcagcaaat 


gggctaagga 


gcgcaaggat 


gatcttgata 


acgagaaaat 


gctgggtact 


2160 


agagaacgca 


agcttacgta 


tggctgcttg 


tgggcgtttc 


gcactaagaa 


agtacattcg 


2220 


ttttatcgcc 


cacctggaac 


gcagacctgc 


gtaaaagtcc 


cagcctcttt 


tagcgctttt 


2280 


cccatgtcgt 


ccgtatggac 


gacctctttg 


cccatgtcgc 


tgaggcagaa 


attgaaactg 


2340 


gcattgcaac 


caaagaagga 


ggaaaaactg 


ctgcaggtct 


cggaggaatt 


agtcatggag 


2400 


gccaaggctg 


cttttgagga 


tgctcaggag 


gaagccagag 


cggagaagct 


ccgagaagca 


2460 


cttccaccat 


tagtggcaga 


caaaggcatc 


gaggcagccg 


cagaagttgt 


ctgcgaagtg 


2520 


gaggggctcc 


aggcggacat 


cggagcagca 


ttagttgaaa 


ccccgcgcgg 


tcacgtaagg 


2580 


ataatacctc 


aagcaaatga 


ccgtatgatc 


ggacagtata 


tcgttgtctc 


gccaaactct 


2640 


gtgctgaaga 


atgccaaact 


cgcaccagcg 


cacccgctag 


cagatcaggt 


taagatcata 


2700 


acacactccg 


gaagatcagg 


aaggtacgcg 


gtcgaaccat 


acgacgctaa 


agtactgatg 


2760 


ccagcaggag 


gtgccgtacc 


atggccagaa 


ttcctagcac 


tgagtgagag 


cgccacgtta 


2820 


gtgtacaacg 


aaagagagtt 


tgtgaaccgc 


aaactatacc 


acattgccat 


gcatggcccc 


2880 


gccaagaata 


cagaagagga 


gcagtacaag 


gttacaaagg 


cagagcttgc 


agaaacagag 


2940 


tacgtgtttg 


acgtggacaa 


gaagcgttgc 


gttaagaagg 


aagaagcctc 


aggtctggtc 


3000 


ctctcgggag 


aactgaccaa 


ccctccctat 


catgagctag 


ctctggaggg 


actgaagacc 


3060 


cgacctgcgg 


tcccgtacaa 


ggtcgaaaca 


ataggagtga 


taggcacacc 


ggggtcgggc 


3120 


aagtcagcta 


ttatcaagtc 


aactgtcacg 


gcacgagatc 


ttgttaccag 


cggaaagaaa 


3180 


gaaaattgtc 


gcgaaattga 


ggccgacgtg 


ctaagactga 


ggggtatgca 


gattacgtcg 


3240 


aagacagtag 


attcggttat 


gctcaacgga 


tgccacaaag 


ccgtagaagt 


gctgtacgtt 


3300 


gacgaagcgt 


tcgcgtgcca 


cgcaggagca 


ctacttgcct 


tgattgctat 


cgtcaggccc 


3360 


cgcaagaagg 


tagtactatg 


cggagacccc 


atgcaatgcg 


gattcttcaa 


catgatgcaa 


3420 


ctaaaggtac 


atttcaatca 


ccctgaaaaa 


gacatatgca 


ccaagacatt 


ctacaagtat 


3480 


atctcccggc 


gttgcacaca 


gccagttaca 


gctattgtat 


cgacactgca 


ttacgatgga 


3540 
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aagatgaaaa 


ccacgaaccc 


gtgcaagaag 


aacattgaaa 


tcgatattac 


aggggccaca 


3600 


aagccgaagc 


caggggatat 


catcctgaca 


tgtttccgcg 


ggtgggttaa 


gcaattgcaa 


3660 


atcgactatc 


ccggacatga 


agtaatgaca 


gccgcggcct 


cacaagggct 


aaccagaaaa 


3720 


ggagtgtatg 


ccgtccggca 


aaaagtcaat 


gaaaacccac 


tgtacgcgat 


cacatcagag 


3780 


catgtgaacg 


tgttgctcac 


ccgcactgag 


gacaggctag 


tgtggaaaac 


cttgcagggc 


3840 


gacccatgga 


ttaagcagct 


cactaacata 


cctaaaggaa 


actttcaggc 


tactatagag 


3900 


gactgggaag 


ctgaacacaa 


gggaataatt 


gctgcaataa 


acagccccac 


tccccgtgcc 


3960 


aatccgttca 


gctgcaagac 


caacgtttgc 


tgggcgaaag 


cattggaacc 


gatactagcc 


4020 


acggccggta 


tcgtacttac 


cggttgccag 


tggagcgaac 


tgttcccaca 


gtttgcggat 


4080 


gacaaaccac 


attcggccat 


ttacgcctta 


gacgtaattt 


gcattaagtt 


tttcggcatg 


4140 


gacttgacaa 


gcggactgtt 


ttctaaacag 


agcatcccac 


taacgtacca 


tcccgccgat 


4200 


tcagcgaggc 


cggtagctca 


ttgggacaac 


agcccaggaa 


cccgcaagta 


tgggtacgat 


4260 


cacgccattg 


ccgccgaact 


ctcccgtaga 


tttccggtgt 


tccagctagc 


tgggaagggc 


4320 


acacaacttg 


atttgcagac 


ggggagaacc 


agagttatct 


ctgcacagca 


taacctggtc 


4380 


ccggtgaacc 


gcaatcttcc 


tcacgcctta 


gtccccgagt 


acaaggagaa 


gcaacccggc 


4440 


ccggtcgaaa 


aattcttgaa 


ccagttcaaa 


caccactcag 


tacttgtggt 


atcagaggaa 


4500 


aaaattgaag 


ctccccgtaa 


gagaatcgaa 


tggatcgccc 


cgattggcat 


agccggtgca 


4560 


gataagaact 


acaacctggc 


tttcgggttt 


ccgccgcagg 


cacggtacga 


cctggtgttc 


4 620 


atcaacattg 


gaactaaata 


cagaaaccac 


cactttcagc 


agtgcgaaga 


ccatgcggcg 


4 680 


accttaaaaa 


ccctttcgcg 


ttcggccctg 


aattgtttaa 


actcaggagg 


caccctcgtg 


4740 


gtgaagtcct 


atggctacgc 


cgaccgcaac 


agtgaggacg 


tagtcaccgc 


tcttgccaga 


4800 


aagtttgtca 


gggtgtctgc 


agcgagacca 


gattgtgtct 


caagcaatac 


agaaatgtac 


4860 


ctgattttcc 


gacaactaga 


caacagccgt 


acacggcaat 


tcaccccgca 


ccatctgaat 


4 920 


tgcgtgattt 


cgtccgtgta 


tgagggtaca 


agagatggag 


ttggagccgc 


gccgtcatac 


4 980 


cgcaccaaaa 


gggagaatat 


tgctgactgt 


caagaggaag 


cagttgtcaa 


cgcagccaat 


504 0 


ccgctgggta 


gaccaggcga 


aggagtctgc 


cgtgccatct 


ataaacgttg 


gccgaccagt 


5100 


tttaccgatt 


cagccacgga 


gacaggcacc 


gcaagaatga 


ctgtgtgcct 


aggaaagaaa 


5160 


gtgatccacg 


cggtcggccc 


tgatttccgg 


aagcacccag 


aagcagaagc 


cttgaaattg 


5220 


ctacaaaacg 


cctaccatgc 


agtggcagac 


ttagtaaatg 


aacataacat 


caagtctgtc 


5280 


gccattccac 


tgctatctac 


aggcatttac 


gcagccggaa 


aagaccgcct 


tgaagtatca 


5340 


cttaactgct 


tgacaaccgc 


gctagacaga 


actgacgcgg 


acgtaaccat 


ctattgcctg 


5400 
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gataagaagt 


ggaaggaaag 


aatcgacgcg 


gcactccaac 


ttaaggagtc 


tgtaacagag 


5460 


ctgaaggatg 


aagatatgga 


gatcgacgat 


gagttagtat 


ggattcatcc 


agacagttgc 


5520 


ttgaagggaa 


gaaagggatt 


cagtactaca 


aaaggaaaat 


tgtattcgta 


cttcgaaggc 


5580 


accaaattcc 


atcaagcagc 


aaaagacatg 


gcggagataa 


aggtcctgtt 


ccctaatgac 


5640 


caggaaagta 


atgaacaact 


gtgtgcctac 


atattgggtg 


agaccatgga 


agcaatccgc 


5700 


gaaaagtgcc 


cggtcgacca 


taacccgtcg 


tctagcccgc 


ccaaaacgtt 


gccgtgcctt 


5760 


tgcatgtatg 


ccatgacgcc 


agaaagggtc 


cacagactta 


gaagcaataa 


cgtcaaagaa 


5820 


gttacagtat 


gctcctccac 


cccccttcct 


aagcacaaaa 


ttaagaatgt 


tcagaaggtt 


5880 


cagtgcacga 


aagtagtcct 


gtttaatccg 


cacactcccg 


cattcgttcc 


cgcccgtaag 


5940 


tacatagaag 


tgccagaaca 


gcctaccgct 


cctcctgcac 


aggccgagga 


ggcccccgaa 


6000 


gttgtagcga 


caccgtcacc 


atctacagct 


gataacacct 


cgcttgatgt 


cacagacatc 


6060 


tcactggata 


tggatgacag 


tagcgaaggc 


tcactttttt 


cgagctttag 


cggatcggac 


6120 


aactctatta 


ctagtatgga 


cagttggtcg 


tcaggaccta 


gttcactaga 


gatagtagac 


6180 


cgaaggcagg 


tggtggtggc 


tgacgttcat 


gccgtccaag 


agcctgcccc 


tattccaccg 


6240 


ccaaggctaa 


agaagatggc 


ccgcctggca 


gcggcaagaa 


aagagcccac 


tccaccggca 


6300 


agcaatagct 


ctgagtccct 


ccacctctct 


tttggtgggg 


tatccatgtc 


cctcggatca 


6360 


attttcgacg 


gagagacggc 


ccgccaggca 


gcggtacaac 


ccctggcaac 


aggccccacg 


6420 


gatgtgccta 


tgtctttcgg 


atcgttttcc 


gacggagaga 


ttgatgagct 


gagccgcaga 


6480 


gtaactgagt 


ccgaacccgt 


cctgtttgga 


tcatttgaac 


cgggcgaagt 


gaactcaatt 


6540 


atatcgtccc 


gatcagccgt 


atcttttcct 


ctacgcaagc 


agagacgtag 


acgcaggagc 


6600 


aggaggactg 


aatactgact 


aaccggggta 


ggtgggtaca 


tattttcgac 


ggacacaggc 


6660 


cctgggcact 


tgcaaaagaa 


gtccgttctg 


cagaaccagc 


ttacagaacc 


gaccttggag 


6720 


cgcaatgtcc 


tggaaagaat 


tcatgccccg 


gtgctcgaca 


cgtcgaaaga 


ggaacaactc 


6780 


aaactcaggt 


accagatgat 


gcccaccgaa 


gccaacaaaa 


gtaggtacca 


gtctcgtaaa 


6840 


gtagaaaatc 


agaaagccat 


aaccactgag 


cgactactgt 


caggactacg 


actgtataac 


6900 


tctgccacag 


atcagccaga 


atgctataag 


atcacctatc 


cgaaaccatt 


gtactccagt 


6960 


agcgtaccgg 


cgaactactc 


cgatccacag 


ttcgctgtag 


ctgtctgtaa 


caactatctg 


7020 


catgagaact 


atccgacagt 


agcatcttat 


cagattactg 


acgagtacga 


tgcttacttg 


7080 


gatatggtag 


acgagacagt 


cgcatgcctg 


gatactgcaa 


ccttctgccc 


cgctaagctt 


7140 


agaagttacc 


cgaaaaaaca 


tgagtataga 


gccccgaata 


tccgcagtgc 


ggttccatca 


7200 


gcgatgcaga 


acacgctaca 


aaatgtgctc 


attgccgcaa 


ctaaaagaaa 


ttgcaacgtc 


7260 


acgcagatgc 


gtgaactgcc 


aacactggac 


tcagcgacat 


tcaatgtcga 


atgctttcga 


7320 
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aaatatgcat 


gtaatgacga 


gtattgggag 


gagttcgctc 


ggaagccaat 


taggattacc 


7380 


actgagtttg 


tcaccgcata 


tgtagctaga 


ctgaaaggcc 


ctaaggccgc 


cgcactattt 


74 40 


gcaaagacgt 


ataatttggt 


cccattgcaa 


gaagtgccta 


tggatagatt 


cgtcatggac 


7500 


atgaaaagag 


acgtgaaagt 


tacaccaggc 


acgaaacaca 


cagaagaaag 


accgaaagta 


7560 


caagtgatac 


aagccgcaga 


acccctggcg 


actgcttact 


tatgcgggat 


tcaccgggaa 


7620 


ttagtgcgta 


ggcttacggc 


cgtcttgctt 


ccaaacattc 


acacgctttt 


tgacatgtcg 


7680 


gcggaggatt 


ttgatgcaat 


catagcagaa 


cacttcaagc 


aaggcgaccc 


ggtactggag 


7740 


acggatatcg 


catcattcga 


caaaagccaa 


gacgacgcta 


tggcgttaac 


cggtctgatg 


7800 


atcttggagg 


acctgggtgt 


ggatcaacca 


ctactcgact 


tgatcgagtg 


cgcctttgga 


7860 


gaaatatcat 


ccacccatct 


acctacgggt 


actcgtttta 


aattcggggc 


gatgatgaaa 


7920 


tccggaatgt 


tcctcacact 


ttttgtcaac 


acagttttga 


atgtcgttat 


cgccagcaga 


7980 


gtactagaag 


agcggcttaa 


aacgtccaga 


tgtgcagcgt 


tcattggcga 


cgacaacatc 


8040 


atacatggag 


tagtatctga 


caaagaaatg 


gctgagaggt 


gcgccacctg 


gctcaacatg 


8100 


gaggttaaga 


tcatcgacgc 


agtcatcggt 


gagagaccac 


cttacttctg 


cggcggattt 


8160 


atcttgcaag 


attcggttac 


ttccacagcg 


tgccgcgtgg 


cggatcccct 


gaaaaggctg 


8220 


tttaagttgg 


gtaaaccgct 


cccagccgac 


gacgagcaag 


acgaagacag 


aagacgcgct 


8280 


ctgctagatg 


aaacaaaggc 


gtggtttaga 


gtaggtataa 


caggcacttt 


agcagtggcc 


8340 


gtgacgaccc 


ggtatgaggt 


agacaatatt 


acacctgtcc 


tactggcatt 


gagaactttt 


8400 


gcccagagca 


aaagagcatt 


ccaagccatc 


agaggggaaa 


taaagcatct 


ctacggtggt 


8 4 60 


cctaaatagt 


cagcatagta 


catttcatct 


gactaatacc 


acaacaccac 


cacctctaga 


8520 


ttaattaact 


cgaggcgcgc 


cgggcccaga 


ggatctttgt 


gaaggaacct 


tacttctgtg 


8580 


gtgtgacata 


attggacaaa 


ctacctacag 


agatttaaag 


ctctaaggta 


aatataaaat 


864 0 


ttttaagtgt 


ataatgtgtt 


aaactactga 


ttctaattgt 


ttgtgtattt 


tagattccaa 


8700 


cctatggaac 


tgatgaatgg 


gagcagtggt 


ggaatgcctt 


taatgaggaa 


aacctgtttt 


87 60 


gctcagaaga 


aatgccatct 


agtgatgatg 


aggctactgc 


tgactctcaa 


cattctactc 


8820 


ctccaaaaaa 


gaagagaaag 


gtagaagacc 


ccaaggactt 


tccttcagaa 


ttgctaagtt 


8880 


ttttgagtca 


tgctgtgttt 


agtaatagaa 


ctcttgcttg 


ctttgctatt 


tacaccacaa 


8940 


aggaaaaagc 


tgcactgcta 


tacaagaaaa 


ttatggaaaa 


atatttgatg 


tagcggccca 


9000 


atgatccgac 


cagcaaaact 


cgatgtactt 


ccgaggaact 


gatgtgcata 


atgcatcagg 


9060 


ctggtacatt 


agatccccgc 


ttaccgcggg 


caatatagca 


acactaaaaa 


ctcgatgtac 


9120 


ttccgaggaa 


gcgcagtgca 


taatgctgcg 


cagtgttgcc 


acataaccac 


tatattaacc 


9180 
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atttatctag 


cggacgccaa 


aaactcaatg 


tatttctgag 


gaagcgtggt 


gcataatgcc 


9240 


acgcagcgtc 


tgcataactt 


ttattatttc 


ttttattaat 


caacaaaatt 


ttgtttttaa 


9300 


catttcaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaagg 


ggaattccca 


acttgtttat 


9360 


tgcagcttat 


aatggttaca 


aataaagcaa 


tagcatcaca 


aatttcacaa 


ataaagcatt 


9420 


tttttcactg 


cattctagtt 


gtggtttgtc 


caaactcatc 


aatgtatctt 


atcatgtctg 


9480 


gatccgtcga 


gacgcgtgcg 


gccgcgggaa 


ttcgatcttt 


atgtgtaact 


cttggctgaa 


9540 


gctcttacac 


caatgctggg 


ggacatgtac 


ctcccagggg 


cccaggaaga 


ctacgggagg 


9600 


ctacaccaac 


gtcaatcaga 


ggggcctgtg 


tagctaccga 


taagcggacc 


ctcaagaggg 


9660 


cattagcaat 


agtgtttata 


aggccccctt 


gttaacccta 


aacgggtagc 


atatgcttcc 


9720 


cgggtagtag 


tatatactat 


ccagactaac 


cctaattcaa 


tagcatatgt 


tacccaacgg 


9780 


gaagcatatg 


ctatcgaatt 


agggttagta 


aaagggtcct 


aaggaacagc 


gatatctccc 


9840 


accccatgag 


ctgtcacggt 


tttatttaca 


tggggtcagg 


attccacgag 


ggtagtgaac 


9900 


cattttagtc 


acaagggcag 


tggctgaaga 


tcaaggagcg 


ggcagtgaac 


tctcctgaat 


9960 


cttcgcctgc 


ttcttcattc 


tccttcgttt 


agctaataga 


ataactgctg 


agttgtgaac 


10020 


agtaaggtgt 


atgtgaggtg 


ctcgaaaaca 


aggtttcagg 


tgacgccccc 


agaataaaat 


10080 


ttggacgggg 


ggttcagtgg 


tggcattgtg 


ctatgacacc 


aatataaccc 


tcacaaaccc 


10140 


cttgggcaat 


aaatactagt 


gtaggaatga 


aacattctga 


atatctttaa 


caatagaaat 


10200 


ccatggggtg 


gggacaagcc 


gtaaagactg 


gatgtccatc 


tcacacgaat 


ttatggctat 


10260 


gggcaacaca 


taatcctagt 


gcaatatgat 


actggggtta 


ttaagatgtg 


tcccaggcag 


10320 


ggaccaagac 


aggtgaacca 


tgttgttaca 


ctctatttgt 


aacaagggga 


aagagagtgg 


10380 


acgccgacag 


cagcggactc 


cactggttgt 


ctctaacacc 


cccgaaaatt 


aaacggggct 


10440 


ccacgccaat 


ggggcccata 


aacaaagaca 


agtggccact 


cttttttttg 


aaattgtgga 


10500 


gtgggggcac 


gcgtcagccc 


ccacacgccg 


ccctgcggtt 


ttggactgta 


aaataagggt 


10560 


gtaataactt 


ggctgattgt 


aaccccgcta 


accactgcgg 


tcaaaccact 


tgcccacaaa 


10620 


accactaatg 


gcaccccggg 


gaatacctgc 


ataagtaggt 


gggcgggcca 


agataggggc 


10680 


gcgattgctg 


cgatctggag 


gacaaattac 


acacacttgc 


gcctgagcgc 


caagcacagg 


10740 


gttgttggtc 


ctcatattca 


cgaggtcgct 


gagagcacgg 


tgggctaatg 


ttgccatggg 


10800 


tagcatatac 


tacccaaata 


tctggatagc 


atatgctatc 


ctaatctata 


tctgggtagc 


10860 


ataggctatc 


ctaatctata 


tctgggtagc 


atatgctatc 


ctaatctata 


tctgggtagt 


10920 


atatgctatc 


ctaatttata 


tctgggtagc 


ataggctatc 


ctaatctata 


tctgggtagc 


10980 


atatgctatc 


ctaatctata 


tctgggtagt 


atatgctatc 


ctaatctgta 


tccgggtagc 


11040 


atatgctatc 


ctaatagaga 


ttagggtagt 


atatgctatc 


ctaatttata 


tctgggtagc 


11100 
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atatactacc 


caaatatctg 


gatagcatat 


gctatcctaa 


tctatatctg 


ggtagcatat 


11160 


gctatcctaa 


tctatatctg 


ggtagcatag 


gctatcctaa 


tctatatctg 


ggtagcatat 


11220 


gctatcctaa 


tctatatctg 


ggtagtatat 


gctatcctaa 


tttatatctg 


ggtagcatag 


11280 


gctatcctaa 


tctatatctg 


ggtagcatat 


gctatcctaa 


tctatatctg 


ggtagtatat 


11340 


gctatcctaa 


tctgtatccg 


ggtagcatat 


gctatcctca 


tgcatataca 


gtcagcatat 


11400 


gatacccagt 


agtagagtgg 


gagtgctatc 


ctttgcatat 


gccgccacct 


cccaaggggg 


11460 


cgtgaatttt 


cgctgcttgt 


ccttttcctg 


catgcatcac 


tagtgaattc 


gcggccgcgt 


11520 


tagggtgtgg 


aaagtcccca 


ggctccccag 


caggcagaag 


tatgcaaagc 


atgcatctca 


11580 


attagtcagc 


aaccaggtgt 


ggaaagtccc 


caggctcccc 


agcaggcaga 


agtatgcaaa 


11640 


gcatgcatct 


caattagtca 


gcaaccatag 


tcccgcccct 


aactccgccc 


atcccgcccc 


11700 


taactccgcc 


cagttccgcc 


cattctccgc 


cccatggctg 


actaattttt 


tttatttatg 


11760 


cagaggccga 


ggccgcctcg 


gcctctgagc 


tattccagaa 


gtagtgagga 


ggcttttttg 


11820 


gaggcctagg 


cttttgcaaa 


aagcttgcat 


gcctgcaggt 


cggccgccac 


gaccggtgcc 


11880 


gccaccatcc 


cctgacccac 


gcccctgacc 


cctcacaagg 


agacgacctt 


ccatgaccga 


11940 


gtacaagccc 


acggtgcgcc 


tcgccacccg 


cgacgacgtc 


ccccgggccg 


tacgcaccct 


12000 


cgccgccgcg 


ttcgccgact 


accccgccac 


gcgccacacc 


gtcgacccgg 


accgccacat 


12060 


cgagcgggtc 


accgagctgc 


aagaactctt 


cctcacgcgc 


gtcgggctcg 


acatcggcaa 


12120 


ggtgtgggtc 


gcggacgacg 


gcgccgcggt 


ggcggtctgg 


accacgccgg 


agagcgtcga 


12180 


agcgggggcg 


gtgttcgccg 


agatcggccc 


gcgcatggcc 


gagttgagcg 


gttcccggct 


12240 


ggccgcgcag 


caacagatgg 


aaggcctcct 


ggcgccgcac 


cggcccaagg 


agcccgcgtg 


12300 


gttcctggcc 


accgtcggcg 


tctcgcccga 


ccaccagggc 


aagggtctgg 


gcagcgccgt 


12360 


cgtgctcccc 


ggagtggagg 


cggccgagcg 


cgccggggtg 


cccgccttcc 


tggagacctc 


12420 


cgcgccccgc 


aacctcccct 


tctacgagcg 


gctcggcttc 


accgtcaccg 


ccgacgtcga 


12480 


ggtgcccgaa 


ggaccgcgca 


cctggtgcat 


gacccgcaag 


cccggtgcct 


gacgcccgcc 


12540 


ccacgacccg 


cagcgcccga 


ccgaaaggag 


cgcacgaccc 


catggctccg 


accgaagcca 


12600 


cccggggcgg 


ccccgccgac 


cccgcacccg 


cccccgaggc 


ccaccgactc 


tagctagagg 


12660 


atcataatca 


gccataccac 


atttgtagag 


gttttacttg 


ctttaaaaaa 


cctcccacac 


12720 


ctccccctga 


acctgaaaca 


taaaatgaat 


gcaattgttg 


ttgttaactt 


gtttattgca 


12780 


gcttataatg 


gttacaaata 


aagcaatagc 


atcacaaatt 


tcacaaataa 


agcatttttt 


12840 


tcactgcatt 


ctagttgtgg 


tttgtccaac 


gcgtcggtac 


cagcttttgt 


tccctttagt 


12900 


gagggttaat 


ttcgagcttg 


gcgtaatcat 


ggtcatagct 


gtttcctgtg 


tgaaattgtt 


12960 
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atccgctcac 


aattccacac 


aacatacgag 


ccggaagcat 


aaagtgtaaa 


gcctggggtg 


13020 


cctaatgagt 


gagctaactc 


acattaattg 


cgtttcgctc 


actgcccgct 


ttccagtcgg 


13080 


gaaacctgtc 


gtgccagctg 


cattaatgaa 


tcggccaacg 


cgcggggaga 


ggcggtttgc 


13140 


gtattgggcg 


ctcttccgct 


tcctcgctca 


ctgactcgct 


gcgctcggtc 


gttcggctgc 


13200 


ggcgagcggt 


atcagctcac 


tcaaaggcgg 


taatacggtt 


atccacagaa 


tcaggggata 


13260 


acgcaggaaa 


gaacatgtga 


gcaaaaggcc 


agcaaaaggc 


caggaaccgt 


aaaaaggccg 


13320 


cgttgctggc 


gtttttccat 


aggctccgcc 


cccctgacga 


gcatcacaaa 


aatcgacgct 


13380 


caagtcagag 


gtggcgaaac 


ccgacaggac 


tataaagata 


ccaggcgttt 


ccccctggaa 


13440 


gctccctcgt 


gcgctctcct 


gttccgaccc 


tgccgcttac 


cggatacctg 


tccgcctttc 


13500 


tcccttcggg 


aagcgtggcg 


ctttctcata 


gctcacgctg 


taggtatctc 


agttcggtgt 


13560 


aggtcgttcg 


ctccaagctg 


ggctgtgtgc 


acgaaccccc 


cgttcagccc 


gaccgctgcg 


13620 


ccttatccgg 


taactatcgt 


cttgagtcca 


acccggtaag 


acacgactta 


tcgccactgg 


13680 


cagcagccac 


tggtaacagg 


attagcagag 


cgaggtatgt 


aggcggtgct 


acagagttct 


13740 


tgaagtggtg 


gcctaactac 


ggctacacta 


gaagaacagt 


atttggtatc 


tgcgctctgc 


13800 


tgaagccagt 


taccttcgga 


aaaagagttg 


gtagctcttg 


atccggcaaa 


caaaccaccg 


13860 


ctggtagcgg 


tggttttttt 


gtttgcaagc 


agcagattac 


gcgcagaaaa 


aaaggatctc 


13920 


aagaagatcc 


tttgatcttt 


tctacggggt 


ctgacgctca 


gtggaacgaa 


aactcacgtt 


13980 


aagggatttt 


ggtcatgaga 


ttatcaaaaa 


ggatcttcac 


ctagatcctt 


ttaaattaaa 


14040 


aatgaagttt 


taaatcaatc 


taaagtatat 


atgagtaaac 


ttggtctgac 


agttaccaat 


14100 


gcttaatcag 


tgaggcacct 


atctcagcga 


tctgtctatt 


tcgttcatcc 


atagttgcct 


14160 


gactccccgt 


cgtgtagata 


actacgatac 


gggagggctt 


accatctggc 


cccagtgctg 


14220 


caatgatacc 


gcgagaccca 


cgctcaccgg 


ctccagattt 


atcagcaata 


aaccagccag 


14280 


ccggaagggc 


cgagcgcaga 


agtggtcctg 


caactttatc 


cgcctccatc 


cagtctatta 


14340 


attgttgccg 


ggaagctaga 


gtaagtagtt 


cgccagttaa 


tagtttgcgc 


aacgttgttg 


14400 


ccattgctac 


aggcatcgtg 


gtgtcacgct 


cgtcgtttgg 


tatggcttca 


ttcagctccg 


14460 


gttcccaacg 


atcaaggcga 


gttacatgat 


cccccatgtt 


gtgcaaaaaa 


gcggttagct 


14520 


ccttcggtcc 


tccgatcgtt 


gtcagaagta 


agttggccgc 


agtgttatca 


ctcatggtta 


14580 


tggcagcact 


gcataattct 


cttactgtca 


tgccatccgt 


aagatgcttt 


tctgtgactg 


14640 


gtgagtactc 


aaccaagtca 


ttctgagaat 


agtgtatgcg 


gcgaccgagt 


tgctcttgcc 


14700 


cggcgtcaat 


acgggataat 


accgcgccac 


atagcagaac 


tttaaaagtg 


ctcatcattg 


14760 


gaaaacgttc 


ttcggggcga 


aaactctcaa 


ggatcttacc 


gctgttgaga 


tccagttcga 


14820 


tgtaacccac 


tcgtgcaccc 


aactgatctt 


cagcatcttt 


tactttcacc 


agcgtttctg 


14880 
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ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat 14 940 

gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc 15000 

tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca 15060 

catttccccg aaaagtgcca c 15081 

<210> 22 

<211> 17753 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> cDNA 

<400> 22 



ctgacgcgcc 


ctgtagcggc 


gcattaagcg 


cggcgggtgt 


ggtggttacg 


cgcagcgtga 


60 


ccgctacact 


tgccagcgcc 


ctagcgcccg 


ctcctttcgc 


tttcttccct 


tcctttctcg 


120 


ccacgttcgc 


cggcggatcc 


ggccattagc 


catattattc 


attggttata 


tagcataaat 


180 


caatattggc 


tattggccat 


tgcatacgtt 


gtatccatat 


cataatatgt 


acatttatat 


240 


tggctcatgt 


ccaacattac 


cgccatgttg 


acattgatta 


ttgactagtt 


attaatagta 


300 


atcaattacg 


gggtcattag 


ttcatagccc 


atatatggag 


ttccgcgtta 


cataacttac 


360 


ggtaaatggc 


ccgcctggct 


gaccgcccaa 


cgacccccgc 


ccattgacgt 


caataatgac 


420 


gtatgttccc 


atagtaacgc 


caatagggac 


tttccattga 


cgtcaatggg 


tggagtattt 


480 


acggtaaact 


gcccacttgg 


cagtacatca 


agtgtatcat 


atgccaagta 


cgccccctat 


540 


tgacgtcaat 


gacggtaaat 


ggcccgcctg 


gcattatgcc 


cagtacatga 


ccttatggga 


600 


ctttcctact 


tggcagtaca 


tctacgtatt 


agtcatcgct 


attaccatgg 


tgatgcggtt 


660 


ttggcagtac 


atcaatgggc 


gtggatagcg 


gtttgactca 


cggggatttc 


caagtctcca 


720 


ccccattgac 


gtcaatggga 


gtttgttttg 


gcaccaaaat 


caacgggact 


ttccaaaatg 


780 


tcgtaacaac 


tccgccccat 


tgacgcaaat 


gggcggtagg 


cgtgtacggt 


gggaggtcta 


840 


tataagcaga 


gctcgtttag 


tgaaccgtat 


tgacggcgta 


gtacacacta 


ttgaatcaaa 


900 


cagccgacca 


attgcactac 


catcacaatg 


gagaagccag 


tagtaaacgt 


agacgtagac 


960 


ccccagagtc 


cgtttgtcgt 


gcaactgcaa 


aaaagcttcc 


cgcaatttga 


ggtagtagca 


1020 


cagcaggtca 


ctccaaatga 


ccatgctaat 


gccagagcat 


tttcgcatct 


ggccagtaaa 


1080 


ctaatcgagc 


tggaggttcc 


taccacagcg 


acgatcttgg 


acataggcag 


cgcaccggct 


1140 


cgtagaatgt 


tttccgagca 


ccagtatcat 


tgtgtctgcc 


ccatgcgtag 


tccagaagac 


1200 


ccggaccgca 


tgatgaaata 


tgccagtaaa 


ctggcggaaa 


aagcgtgcaa 


gattacaaac 


1260 


aagaacttgc 


atgagaagat 


taaggatctc 


cggaccgtac 


ttgatacgcc 


ggatgctgaa 


1320 



'■ 27 



acaccatcgc 


tctgctttca 


caacgatgtt 


acctgcaaca 


tgcgtgccga 


atattccgtc 


1380 


atgcaggacg 


tgtatatcaa 


cgctcccgga 


actatctatc 


atcaggctat 


gaaaggcgtg 


1440 


cggaccctgt 


actggattgg 


cttcgacacc 


acccagttca 


tgttctcggc 


tatggcaggt 


1500 


tcgtaccctg 


cgtacaacac 


caactgggcc 


gacgagaaag 


tccttgaagc 


gcgtaacatc 


1560 


ggactttgca 


gcacaaagct 


gagtgaaggt 


aggacaggaa 


aattgtcgat 


aatgaggaag 


1620 


aaggagttga 


agcccgggtc 


gcgggtttat 


ttctccgtag 


gatcgacact 


ttatccagaa 


1680 


cacagagcca 


gcttgcagag 


ctggcatctt 


ccatcggtgt 


tccacttgaa 


tggaaagcag 


1740 


tcgtacactt 


gccgctgtga 


tacagtggtg 


agttgcgaag 


gctacgtagt 


gaagaaaatc 


1800 


accatcagtc 


ccgggatcac 


gggagaaacc 


gtgggatacg 


cggttacaca 


caatagcgag 


1860 


ggcttcttgc 


tatgcaaagt 


tactgacaca 


gtaaaaggag 


aacgggtatc 


gttccctgtg 


1920 


tgcacgtaca 


tcccggccac 


catatgcgat 


cagatgactg 


gtataatggc 


cacggatata 


1980 


tcacctgacg 


atgcacaaaa 


acttctggtt 


gggctcaacc 


agcgaattgt 


cattaacggt 


2040 


aggactaaca 


ggaacaccaa 


caccatgcaa 


aattaccttc 


tgccgatcat 


agcacaaggg 


2100 


ttcagcaaat 


gggctaagga 


gcgcaaggat 


gatcttgata 


acgagaaaat 


gctgggtact 


2160 


agagaacgca 


agcttacgta 


tggctgcttg 


tgggcgtttc 


gcactaagaa 


agtacattcg 


2220 


ttttatcgcc 


cacctggaac 


gcagacctgc 


gtaaaagtcc 


cagcctcttt 


tagcgctttt 


2280 


cccatgtcgt 


ccgtatggac 


gacctctttg 


cccatgtcgc 


tgaggcagaa 


attgaaactg 


2340 


gcattgcaac 


caaagaagga 


ggaaaaactg 


ctgcaggtct 


cggaggaatt 


agtcatggag 


2400 


gccaaggctg 


cttttgagga 


tgctcaggag 


gaagccagag 


cggagaagct 


ccgagaagca 


2460 


cttccaccat 


tagtggcaga 


caaaggcatc 


gaggcagccg 


cagaagttgt 


ctgcgaagtg 


2520 


gaggggctcc 


aggcggacat 


cggagcagca 


ttagttgaaa 


ccccgcgcgg 


tcacgtaagg 


2580 


ataatacctc 


aagcaaatga 


ccgtatgatc 


ggacagtata 


tcgttgtctc 


gccaaactct 


2640 


gtgctgaaga 


atgccaaact 


cgcaccagcg 


cacccgctag 


cagatcaggt 


taagatcata 


2700 


acacactccg 


gaagatcagg 


aaggtacgcg 


gtcgaaccat 


acgacgctaa 


agtactgatg 


2760 


ccagcaggag 


gtgccgtacc 


atggccagaa 


ttcctagcac 


tgagtgagag 


cgccacgtta 


2820 


gtgtacaacg 


aaagagagtt 


tgtgaaccgc 


aaactatacc 


acattgccat 


gcatggcccc 


2880 


gccaagaata 


cagaagagga 


gcagtacaag 


gttacaaagg 


cagagcttgc 


agaaacagag 


2940 


tacgtgtttg 


acgtggacaa 


gaagcgttgc 


gttaagaagg 


aagaagcctc 


aggtctggtc 


3000 


ctctcgggag 


aactgaccaa 


ccctccctat 


catgagctag 


ctctggaggg 


actgaagacc 


3060 


cgacctgcgg 


tcccgtacaa 


ggtcgaaaca 


ataggagtga 


taggcacacc 


ggggtcgggc 


3120 


aagtcagcta 


ttatcaagtc 


aactgtcacg 


gcacgagatc 


ttgttaccag 


cggaaagaaa 


3180 


gaaaattgtc 


gcgaaattga 


ggccgacgtg 


ctaagactga 


ggggtatgca 


gattacgtcg 


3240 



aagacagtag 


attcggttat 


gctcaacgga 


tgccacaaag 


ccgtagaagt 


gctgtacgtt 


3300 


gacgaagcgt 


tcgcgtgcca 


cgcaggagca 


ctacttgcct 


tgattgctat 


cgtcaggccc 


3360 


cgcaagaagg 


tagtactatg 


cggagacccc 


atgcaatgcg 


gattcttcaa 


catgatgcaa 


3420 


ctaaaggtac 


atttcaatca 


ccctgaaaaa 


gacatatgca 


ccaagacatt 


ctacaagtat 


3480 


atctcccggc 


gttgcacaca 


gccagttaca 


gctattgtat 


cgacactgca 


ttacgatgga 


3540 


aagatgaaaa 


ccacgaaccc 


gtgcaagaag 


aacattgaaa 


tcgatattac 


aggggccaca 


3600 


aagccgaagc 


caggggatat 


catcctgaca 


tgtttccgcg 


ggtgggttaa 


gcaattgcaa 


3660 


atcgactatc 


ccggacatga 


agtaatgaca 


gccgcggcct 


cacaagggct 


aaccagaaaa 


3720 


ggagtgtatg 


ccgtccggca 


aaaagtcaat 


gaaaacccac 


tgtacgcgat 


cacatcagag 


3780 


catgtgaacg 


tgttgctcac 


ccgcactgag 


gacaggctag 


tgtggaaaac 


cttgcagggc 


3840 


gacccatgga 


ttaagcagct 


cactaacata 


cctaaaggaa 


actttcaggc 


tactatagag 


3900 


gactgggaag 


ctgaacacaa 


gggaataatt 


gctgcaataa 


acagccccac 


tccccgtgcc 


3960 


aatccgttca 


gctgcaagac 


caacgtttgc 


tgggcgaaag 


cattggaacc 


gatactagcc 


4020 


acggccggta 


tcgtacttac 


cggttgccag 


tggagcgaac 


tgttcccaca 


gtttgcggat 


4080 


gacaaaccac 


attcggccat 


ttacgcctta 


gacgtaattt 


gcattaagtt 


tttcggcatg 


4140 


gacttgacaa 


gcggactgtt 


ttctaaacag 


agcatcccac 


taacgtacca 


tcccgccgat 


4200 


tcagcgaggc 


cggtagctca 


ttgggacaac 


agcccaggaa 


cccgcaagta 


tgggtacgat 


4260 


cacgccattg 


ccgccgaact 


ctcccgtaga 


tttccggtgt 


tccagctagc 


tgggaagggc 


4320 


acacaacttg 


atttgcagac 


ggggagaacc 


agagttatct 


ctgcacagca 


taacctggtc 


4380 


ccggtgaacc 


gcaatcttcc 


tcacgcctta 


gtccccgagt 


acaaggagaa 


gcaacccggc 


4440 


ccggtcgaaa 


aattcttgaa 


ccagttcaaa 


caccactcag 


tacttgtggt 


atcagaggaa 


4500 


aaaattgaag 


ctccccgtaa 


gagaatcgaa 


tggatcgccc 


cgattggcat 


agccggtgca 


4560 


gataagaact 


acaacctggc 


tttcgggttt 


ccgccgcagg 


cacggtacga 


cctggtgttc 


4620 


atcaacattg 


gaactaaata 


cagaaaccac 


cactttcagc 


agtgcgaaga 


ccatgcggcg 


4 680 


accttaaaaa 


ccctttcgcg 


ttcggccctg 


aattgtttaa 


actcaggagg 


caccctcgtg 


4740 


gtgaagtcct 


atggctacgc 


cgaccgcaac 


agtgaggacg 


tagtcaccgc 


tcttgccaga 


4800 


aagtttgtca 


gggtgtctgc 


agcgagacca 


gattgtgtct 


caagcaatac 


agaaatgtac 


4 860 


ctgattttcc 


gacaactaga 


caacagccgt 


acacggcaat 


tcaccccgca 


ccatctgaat 


4920 


tgcgtgattt 


cgtccgtgta 


tgagggtaca 


agagatggag 


ttggagccgc 


gccgtcatac 


4980 


cgcaccaaaa 


gggagaatat 


tgctgactgt 


caagaggaag 


cagttgtcaa 


cgcagccaat 


5040 


ccgctgggta 


gaccaggcga 


aggagtctgc 


cgtgccatct 


ataaacgttg 


gccgaccagt 


5100 
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tttaccgatt 


cagccacgga 


gacaggcacc 


gcaagaatga 


ctgtgtgcct 


aggaaagaaa 


5160 


gtgatccacg 


cggtcggccc 


tgatttccgg 


aagcacccag 


aagcagaagc 


cttgaaattg 


5220 


ctacaaaacg 


cctaccatgc 


agtggcagac 


ttagtaaatg 


aacataacat 


caagtctgtc 


5280 


gccattccac 


tgctatctac 


aggcatttac 


gcagccggaa 


aagaccgcct 


tgaagtatca 


5340 


cttaactgct 


tgacaaccgc 


gctagacaga 


actgacgcgg 


acgtaaccat 


ctattgcctg 


5400 


gataagaagt 


ggaaggaaag 


aatcgacgcg 


gcactccaac 


ttaaggagtc 


tgtaacagag 


5460 


ctgaaggatg 


aagatatgga 


gatcgacgat 


gagttagtat 


ggattcatcc 


agacagttgc 


5520 


ttgaagggaa 


gaaagggatt 


cagtactaca 


aaaggaaaat 


tgtattcgta 


cttcgaaggc 


5580 


accaaattcc 


atcaagcagc 


aaaagacatg 


gcggagataa 


aggtcctgtt 


ccctaatgac 


5640 


caggaaagta 


atgaacaact 


gtgtgcctac 


atattgggtg 


agaccatgga 


agcaatccgc 


5700 


gaaaagtgcc 


cggtcgacca 


taacccgtcg 


tctagcccgc 


ccaaaacgtt 


gccgtgcctt 


5760 


tgcatgtatg 


ccatgacgcc 


agaaagggtc 


cacagactta 


gaagcaataa 


cgtcaaagaa 


5820 


gttacagtat 


gctcctccac 


cccccttcct 


aagcacaaaa 


ttaagaatgt 


tcagaaggtt 


5880 


cagtgcacga 


aagtagtcct 


gtttaatccg 


cacactcccg 


cattcgttcc 


cgcccgtaag 


5940 


tacatagaag 


tgccagaaca 


gcctaccgct 


cctcctgcac 


aggccgagga 


ggcccccgaa 


6000 


gttgtagcga 


caccgtcacc 


atctacagct 


gataacacct 


cgcttgatgt 


cacagacatc 


6060 


tcactggata 


tggatgacag 


tagcgaaggc 


tcactttttt 


cgagctttag 


cggatcggac 


6120 


aactctatta 


ctagtatgga 


cagttggtcg 


tcaggaccta 


gttcactaga 


gatagtagac 


6180 


cgaaggcagg 


tggtggtggc 


tgacgttcat 


gccgtccaag 


agcctgcccc 


tattccaccg 


6240 


ccaaggctaa 


agaagatggc 


ccgcctggca 


gcggcaagaa 


aagagcccac 


tccaccggca 


6300 


agcaatagct 


ctgagtccct 


ccacctctct 


tttggtgggg 


tatccatgtc 


cctcggatca 


6360 


attttcgacg 


gagagacggc 


ccgccaggca 


gcggtacaac 


ccctggcaac 


aggccccacg 


6420 


gatgtgccta 


tgtctttcgg 


atcgttttcc 


gacggagaga 


ttgatgagct 


gagccgcaga 


6480 


gtaactgagt 


ccgaacccgt 


cctgtttgga 


tcatttgaac 


cgggcgaagt 


gaactcaatt 


6540 


atatcgtccc 


gatcagccgt 


atcttttcct 


ctacgcaagc 


agagacgtag 


acgcaggagc 


6600 


aggaggactg 


aatactgact 


aaccggggta 


ggtgggtaca 


tattttcgac 


ggacacaggc 


6660 


cctgggcact 


tgcaaaagaa 


gtccgttctg 


cagaaccagc 


ttacagaacc 


gaccttggag 


6720 


cgcaatgtcc 


tggaaagaat 


tcatgccccg 


gtgctcgaca 


cgtcgaaaga 


ggaacaactc 


6780 


aaactcaggt 


accagatgat 


gcccaccgaa 


gccaacaaaa 


gtaggtacca 


gtctcgtaaa 


6840 


gtagaaaatc 


agaaagccat 


aaccactgag 


cgactactgt 


caggactacg 


actgtataac 


6900 


tctgccacag 


atcagccaga 


atgctataag 


atcacctatc 


cgaaaccatt 


gtactccagt 


6960 


agcgtaccgg 


cgaactactc 


cgatccacag 


ttcgctgtag 


ctgtctgtaa 


caactatctg 


7020 
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catgagaact 


atccgacagt 


agcatcttat 


cagattactg 


acgagtacga 


tgcttacttg 


7080 


gatatggtag 


acgagacagt 


cgcatgcctg 


gatactgcaa 


ccttctgccc 


cgctaagctt 


7140 


agaagttacc 


cgaaaaaaca 


tgagtataga 


gccccgaata 


tccgcagtgc 


ggttccatca 


7200 


gcgatgcaga 


acacgctaca 


aaatgtgctc 


attgccgcaa 


ctaaaagaaa 


ttgcaacgtc 


7260 


acgcagatgc 


gtgaactgcc 


aacactggac 


tcagcgacat 


tcaatgtcga 


atgctttcga 


7320 


aaatatgcat 


gtaatgacga 


gtattgggag 


gagttcgctc 


ggaagccaat 


taggattacc 


7380 


actgagtttg 


tcaccgcata 


tgtagctaga 


ctgaaaggcc 


ctaaggccgc 


cgcactattt 


7440 


gcaaagacgt 


ataatttggt 


cccattgcaa 


gaagtgccta 


tggatagatt 


cgtcatggac 


7500 


atgaaaagag 


acgtgaaagt 


tacaccaggc 


acgaaacaca 


cagaagaaag 


accgaaagta 


7560 


caagtgatac 


aagccgcaga 


acccctggcg 


actgcttact 


tatgcgggat 


tcaccgggaa 


7620 


ttagtgcgta 


ggcttacggc 


cgtcttgctt 


ccaaacattc 


acacgctttt 


tgacatgtcg 


7680 


gcggaggatt 


ttgatgcaat 


catagcagaa 


cacttcaagc 


aaggcgaccc 


ggtactggag 


7740 


acggatatcg 


catcattcga 


caaaagccaa 


gacgacgcta 


tggcgttaac 


cggtctgatg 


7800 


atcttggagg 


acctgggtgt 


ggatcaacca 


ctactcgact 


tgatcgagtg 


cgcctttgga 


7860 


gaaatatcat 


ccacccatct 


acctacgggt 


actcgtttta 


aattcggggc 


gatgatgaaa 


7920 


tccggaatgt 


tcctcacact 


ttttgtcaac 


acagttttga 


atgtcgttat 


cgccagcaga 


7980 


gtactagaag 


agcggcttaa 


aacgtccaga 


tgtgcagcgt 


tcattggcga 


cgacaacatc 


8040 


atacatggag 


tagtatctga 


caaagaaatg 


gctgagaggt 


gcgccacctg 


gctcaacatg 


8100 


gaggttaaga 


tcatcgacgc 


agtcatcggt 


gagagaccac 


cttacttctg 


cggcggattt 


8160 


atcttgcaag 


attcggttac 


ttccacagcg 


tgccgcgtgg 


cggatcccct 


gaaaaggctg 


8220 


tttaagttgg 


gtaaaccgct 


cccagccgac 


gacgagcaag 


acgaagacag 


aagacgcgct 


8280 


ctgctagatg 


aaacaaaggc 


gtggtttaga 


gtaggtataa 


caggcacttt 


agcagtggcc 


8340 


gtgacgaccc 


ggtatgaggt 


agacaatatt 


acacctgtcc 


tactggcatt 


gagaactttt 


8400 


gcccagagca 


aaagagcatt 


ccaagccatc 


agaggggaaa 


taaagcatct 


ctacggtggt 


84 60 


cctaaatagt 


cagcatagta 


catttcatct 


gactaatacc 


acaacaccac 


cacctctaga 


8520 


ttaattaact 


cgaggcgcgc 


cgggcccaga 


ggatctttgt 


gaaggaacct 


tacttctgtg 


8580 


gtgtgacata 


attggacaaa 


ctacctacag 


agatttaaag 


ctctaaggta 


aatataaaat 


8640 


ttttaagtgt 


ataatgtgtt 


aaactactga 


ttctaattgt 


ttgtgtattt 


tagattccaa 


8700 


cctatggaac 


tgatgaatgg 


gagcagtggt 


ggaatgcctt 


taatgaggaa 


aacctgtttt 


8760 


gctcagaaga 


aatgccatct 


agtgatgatg 


aggctactgc 


tgactctcaa 


cattctactc 


8820 


ctccaaaaaa 


gaagagaaag 


gtagaagacc 


ccaaggactt 


tccttcagaa 


ttgctaagtt 


8880 
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ttttgagtca 


tgctgtgttt 


agtaatagaa 


ctcttgcttg 


ctttgctatt 


tacaccacaa 


8940 


aggaaaaagc 


tgcactgcta 


tacaagaaaa 


ttatggaaaa 


atatttgatg 


tagcggccca 


9000 


atgatccgac 


cagcaaaact 


cgatgtactt 


ccgaggaact 


gatgtgcata 


atgcatcagg 


9060 


ctggtacatt 


agatccccgc 


ttaccgcggg 


caatatagca 


acactaaaaa 


ctcgatgtac 


9120 


ttccgaggaa 


gcgcagtgca 


taatgctgcg 


cagtgttgcc 


acataaccac 


tatattaacc 


9180 


atttatctag 


cggacgccaa 


aaactcaatg 


tatttctgag 


gaagcgtggt 


gcataatgcc 


9240 


acgcagcgtc 


tgcataactt 


ttattatttc 


ttttattaat 


caacaaaatt 


ttgtttttaa 


9300 


catttcaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaagg 


ggaattccca 


acttgtttat 


9360 


tgcagcttat 


aatggttaca 


aataaagcaa 


tagcatcaca 


aatttcacaa 


ataaagcatt 


9420 


tttttcactg 


cattctagtt 


gtggtttgtc 


caaactcatc 


aatgtatctt 


atcatgtctg 


9480 


gatccgtcga 


gacgcgtgcg 


gccgcgggaa 


ttcgatgtgg 


ctttcatcct 


ggagcagact 


9540 


ttgcagtctg 


tggactgcaa 


cacaacattg 


cctttatgtg 


taactcttgg 


ctgaagctct 


9600 


tacaccaatg 


ctgggggaca 


tgtacctccc 


aggggcccag 


gaagactacg 


ggaggctaca 


9660 


ccaacgtcaa 


tcagaggggc 


ctgtgtagct 


accgataagc 


ggaccctcaa 


gagggcatta 


9720 


gcaatagtgt 


ttataaggcc 


cccttgttaa 


ccctaaacgg 


gtagcatatg 


cttcccgggt 


9780 


agtagtatat 


actatccaga 


ctaaccctaa 


ttcaatagca 


tatgttaccc 


aacgggaagc 


9840 


atatgctatc 


gaattagggt 


tagtaaaagg 


gtcctaagga 


acagcgatat 


ctcccacccc 


9900 


atgagctgtc 


acggttttat 


ttacatgggg 


tcaggattcc 


acgagggtag 


tgaaccattt 


9960 


tagtcacaag 


ggcagtggct 


gaagatcaag 


gagcgggcag 


tgaactctcc 


tgaatcttcg 


10020 


cctgcttctt 


cattctcctt 


cgtttagcta 


atagaataac 


tgctgagttg 


tgaacagtaa 


10080 


ggtgtatgtg 


aggtgctcga 


aaacaaggtt 


tcaggtgacg 


cccccagaat 


aaaatttgga 


10140 


cggggggttc 


agtggtggca 


ttgtgctatg 


acaccaatat 


aaccctcaca 


aaccccttgg 


10200 


gcaataaata 


ctagtgtagg 


aatgaaacat 


tctgaatatc 


tttaacaata 


gaaatccatg 


10260 


gggtggggac 


aagccgtaaa 


gactggatgt 


ccatctcaca 


cgaatttatg 


gctatgggca 


10320 


acacataatc 


ctagtgcaat 


atgatactgg 


ggttattaag 


atgtgtccca 


ggcagggacc 


10380 


aagacaggtg 


aaccatgttg 


ttacactcta 


tttgtaacaa 


ggggaaagag 


agtggacgcc 


10440 


gacagcagcg 


gactccactg 


gttgtctcta 


acacccccga 


aaattaaacg 


gggctccacg 


10500 


ccaatggggc 


ccataaacaa 


agacaagtgg 


ccactctttt 


ttttgaaatt 


gtggagtggg 


10560 


ggcacgcgtc 


agcccccaca 


cgccgccctg 


cggttttgga 


ctgtaaaata 


agggtgtaat 


10620 


aacttggctg 


attgtaaccc 


cgctaaccac 


tgcggtcaaa 


ccacttgccc 


acaaaaccac 


10680 


taatggcacc 


ccggggaata 


cctgcataag 


taggtgggcg 


ggccaagata 


ggggcgcgat 


10740 


tgctgcgatc 


tggaggacaa 


attacacaca 


cttgcgcctg 


agcgccaagc 


acagggttgt 


10800 



tggtcctcat 


attcacgagg 


tcgctgagag 


cacggtgggc 


taatgttgcc 


atgggtagca 


10860 


tatactaccc 


aaatatctgg 


atagcatatg 


ctatcctaat 


ctatatctgg 


gtagcatagg 


10920 


ctatcctaat 


ctatatctgg 


gtagcatatg 


ctatcctaat 


ctatatctgg 


gtagtatatg 


10980 


ctatcctaat 


ttatatctgg 


gtagcatagg 


ctatcctaat 


ctatatctgg 


gtagcatatg 


11040 


ctatcctaat 


ctatatctgg 


gtagtatatg 


ctatcctaat 


ctgtatccgg 


gtagcatatg 


11100 


ctatcctaat 


agagattagg 


gtagtatatg 


ctatcctaat 


ttatatctgg 


gtagcatata 


11160 


ctacccaaat 


atctggatag 


catatgctat 


cctaatctat 


atctgggtag 


catatgctat 


11220 


cctaatctat 


atctgggtag 


cataggctat 


cctaatctat 


atctgggtag 


catatgctat 


11280 


cctaatctat 


atctgggtag 


tatatgctat 


cctaatttat 


atctgggtag 


cataggctat 


11340 


cctaatctat 


atctgggtag 


catatgctat 


cctaatctat 


atctgggtag 


tatatgctat 


11400 


cctaatctgt 


atccgggtag 


catatgctat 


cctcatgcat 


atacagtcag 


catatgatac 


11460 


ccagtagtag 


agtgggagtg 


ctatcctttg 


catatgccgc 


cacctcccaa 


gggggcgtga 


11520 


attttcgctg 


cttgtccttt 


tcctgcatgc 


tggttgctcc 


cattcttagg 


tgaatttaag 


11580 


gaggccaggc 


taaagccgtc 


gcatgtctga 


ttgctcacca 


ggtaaatgtc 


gctaatgttt 


11640 


tccaacgcga 


gaaggtgttg 


agcgcggagc 


tgagtgacgt 


gacaacatgg 


gtatgcccaa 


11700 


ttgccccatg 


ttgggaggac 


gaaaatggtg 


acaagacaga 


tggccagaaa 


tacaccaaca 


11760 


gcacgcatga 


tgtctactgg 


ggatttattc 


tttagtgcgg 


gggaatacac 


ggcttttaat 


11820 


acgattgagg 


gcgtctccta 


acaagttaca 


tcactcctgc 


ccttcctcac 


cctcatctcc 


11880 


atcacctcct 


tcatctccgt 


catctccgtc 


atcaccctcc 


gcggcagccc 


cttccaccat 


1194 0 


aggtggaaac 


cagggaggca 


aatctactcc 


atcgtcaaag 


ctgcacacag 


tcaccctgat 


12000 


attgcaggta 


ggagcgggct 


ttgtcataac 


aaggtcctta 


atcgcatcct 


tcaaaacctc 


12060 


agcaaatata 


tgagtttgta 


aaaagaccat 


gaaataacag 


acaatggact 


cccttagcgg 


12120 


gccaggttgt 


gggccgggtc 


caggggccat 


tccaaagggg 


agacgactca 


atggtgtaag 


12180 


acgacattgt 


ggaatagcaa 


gggcagttcc 


tcgccttagg 


ttgtaaaggg 


aggtcttact 


12240 


acctccatat 


acgaacacac 


cggcgaccca 


agttccttcg 


tcggtagtcc 


tttctacgtg 


12300 


actcctagcc 


aggagagctc 


ttaaaccttc 


tgcaatgttc 


tcaaatttcg 


ggttggaacc 


12360 


tccttgacca 


cgatgctttc 


caaaccaccc 


tccttttttg 


cgcctgcctc 


catcaccctg 


12420 


accccggggt 


ccagtgcttg 


ggccttctcc 


tgggtcatct 


gcggggccct 


gctctatcgc 


12480 


tcccgggggc 


acgtcaggct 


caccatctgg 


gccaccttct 


tggtggtatt 


caaaataatc 


12540 


ggcttcccct 


acagggtgga 


aaaatggcct 


tctacctgga 


gggggcctgc 


gcggtggaga 


12600 


cccggatgat 


gatgactgac 


tactgggact 


cctgggcctc 


ttttctccac 


gtccacgacc 


12660 
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tctccccctg 


gctctttcac 


gacttccccc 


cctggctctt 


tcacgtcctc 


taccccggcg 


12720 


gcctccacta 


cctcctcgac 


cccggcctcc 


actacctcct 


cgaccccggc 


ctccactgcc 


12780 


tcctcgaccc 


cggcctccac 


ctcctgctcc 


tgcccctcct 


gctcctgccc 


ctcctcctgc 


12840 


tcctgcccct 


cctgcccctc 


ctgctcctgc 


ccctcctgcc 


cctcctgctc 


ctgcccctcc 


12900 


tgcccctcct 


gctcctgccc 


ctcctgcccc 


tcctcctgct 


cctgcccctc 


ctgcccctcc 


12960 


tcctgctcct 


gcccctcctg 


cccctcctgc 


tcctgcccct 


cctgcccctc 


ctgctcctgc 


13020 


ccctcctgcc 


cctcctgctc 


ctgcccctcc 


tgctcctgcc 


cctcctgctc 


ctgcccctcc 


13080 


tgctcctgcc 


cctcctgccc 


ctcctgcccc 


tcctcctgct 


cctgcccctc 


ctgctcctgc 


13140 


ccctcctgcc 


cctcctgccc 


ctcctgctcc 


tgcccctcct 


cctgctcctg 


cccctcctgc 


13200 


ccctcctgcc 


cctcctcctg 


ctcctgcccc 


tcctgcccct 


cctcctgctc 


ctgcccctcc 


13260 


tcctgctcct 


gcccctcctg 


cccctcctgc 


ccctcctcct 


gctcctgccc 


ctcctgcccc 


13320 


tcctcctgct 


cctgcccctc 


ctcctgctcc 


tgcccctcct 


gcccctcctg 


cccctcctcc 


13380 


tgctcctgcc 


cctcctcctg 


ctcctgcccc 


tcctgcccct 


cctgcccctc 


ctgcccctcc 


13440 


tcctgctcct 


gcccctcctc 


ctgctcctgc 


ccctcctgct 


cctgcccctc 


ccgctcctgc 


13500 


tcctgctcct 


gttccaccgt 


gggtcccttt 


gcagccaatg 


caacttggac 


gtttttgggg 


13560 


tctccggaca 


ccatctctat 


gtcttggccc 


tgatcctgag 


ccgcccgggg 


ctcctggtct 


13620 


tccgcctcct 


cgtcctcgtc 


ctcttccccg 


tcctcgtcca 


tggttatcac 


cccctcttct 


13680 


ttgaggtcca 


ctgccgccgg 


agccttctgg 


tccagatgtg 


tctcccttct 


ctcctaggcc 


13740 


atttccaggt 


cctgtacctg 


gcccctcgtc 


agacatgatt 


cacactaaaa 


gagatcaata 


13800 


gacatcttta 


ttagacgacg 


ctcagtgaat 


acagggagtg 


cagactcctg 


ccccctccaa 


13860 


cagccccccc 


accctcatcc 


ccttcatggt 


cgctgtcaga 


cagatccagg 


tctgaaaatt 


13920 


ccccatcctc 


cgaaccatcc 


tcgtcctcat 


caccaattac 


tcgcagcccg 


gaaaactccc 


13980 


gctgaacatc 


ctcaagattt 


gcgtcctgag 


cctcaagcca 


ggcctcaaat 


tcctcgtccc 


14040 


cctttttgct 


ggacggtagg 


gatggggatt 


ctcgggaccc 


ctcctcttcc 


tcttcaaggt 


14100 


caccagacag 


agatgctact 


ggggcaacgg 


aagaaaagct 


gggtgcggcc 


tgtgaggatc 


14160 


agcttatatc 


actagtgaat 


tcgcggccgc 


gttagggtgt 


ggaaagtccc 


caggctcccc 


14220 


agcaggcaga 


agtatgcaaa 


gcatgcatct 


caattagtca 


gcaaccaggt 


gtggaaagtc 


14280 


cccaggctcc 


ccagcaggca 


gaagtatgca 


aagcatgcat 


ctcaattagt 


cagcaaccat 


14340 


agtcccgccc 


ctaactccgc 


ccatcccgcc 


cctaactccg 


cccagttccg 


cccattctcc 


14400 


gccccatggc 


tgactaattt 


tttttattta 


tgcagaggcc 


gaggccgcct 


cggcctctga 


14460 


gctattccag 


aagtagtgag 


gaggcttttt 


tggaggccta 


ggcttttgca 


aaaagcttgc 


14520 


atgcctgcag 


gtcggccgcc 


acgaccggtg 


ccgccaccat 


cccctgaccc 


acgcccctga 


14580 
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cccctcacaa 


ggagacgacc 


ttccatgacc 


gagtacaagc 


ccacggtgcg 


cctcgccacc 


14 64 0 


cgcgacgacg 


tcccccgggc 


cgtacgcacc 


ctcgccgccg 


cgttcgccga 


ctaccccgcc 


14700 


acgcgccaca 


ccgtcgaccc 


ggaccgccac 


atcgagcggg 


tcaccgagct 


gcaagaactc 


147 60 


ttcctcacgc 


gcgtcgggct 


cgacatcggc 


aaggtgtggg 


tcgcggacga 


cggcgccgcg 


14 820 


gtggcggtct 


ggaccacgcc 


ggagagcgtc 


gaagcggggg 


cggtgttcgc 


cgagatcggc 


14880 


ccgcgcatgg 


ccgagttgag 


cggttcccgg 


ctggccgcgc 


agcaacagat 


ggaaggcctc 


14 940 


ctggcgccgc 


accggcccaa 


ggagcccgcg 


tggttcctgg 


ccaccgtcgg 


cgtctcgccc 


15000 


gaccaccagg 


gcaagggtct 


gggcagcgcc 


gtcgtgctcc 


ccggagtgga 


ggcggccgag 


15060 


cgcgccgggg 


tgcccgcctt 


cctggagacc 


tccgcgcccc 


gcaacctccc 


cttctacgag 


15120 


cggctcggct 


tcaccgtcac 


cgccgacgtc 


gaggtgcccg 


aaggaccgcg 


cacctggtgc 


15180 


atgacccgca 


agcccggtgc 


ctgacgcccg 


ccccacgacc 


cgcagcgccc 


gaccgaaagg 


15240 


agcgcacgac 


cccatggctc 


cgaccgaagc 


cacccggggc 


ggccccgccg 


accccgcacc 


15300 


cgcccccgag 


gcccaccgac 


tctagctaga 


ggatcataat 


cagccatacc 


acatttgtag 


15360 


aggttttact 


tgctttaaaa 


aacctcccac 


acctccccct 


gaacctgaaa 


cataaaatga 


15420 


atgcaattgt 


tgttgttaac 


ttgtttattg 


cagcttataa 


tggttacaaa 


taaagcaata 


15480 


gcatcacaaa 


tttcacaaat 


aaagcatttt 


tttcactgca 


ttctagttgt 


ggtttgtcca 


15540 


acgcgtcggt 


accagctttt 


gttcccttta 


gtgagggtta 


atttcgagct 


tggcgtaatc 


15600 


atggtcatag 


ctgtttcctg 


tgtgaaattg 


ttatccgctc 


acaattccac 


acaacatacg 


15660 


agccggaagc 


ataaagtgta 


aagcctgggg 


tgcctaatga 


gtgagctaac 


tcacattaat 


15720 


tgcgtttcgc 


tcactgcccg 


ctttccagtc 


gggaaacctg 


tcgtgccagc 


tgcattaatg 


15780 


aatcggccaa 


cgcgcgggga 


gaggcggttt 


gcgtattggg 


cgctcttccg 


cttcctcgct 


15840 


cactgactcg 


ctgcgctcgg 


tcgttcggct 


gcggcgagcg 


gtatcagctc 


actcaaaggc 


15900 


ggtaatacgg 


ttatccacag 


aatcagggga 


taacgcagga 


aagaacatgt 


gagcaaaagg 


15960 


ccagcaaaag 


gccaggaacc 


gtaaaaaggc 


cgcgttgctg 


gcgtttttcc 


ataggctccg 


16020 


cccccctgac 


gagcatcaca 


aaaatcgacg 


ctcaagtcag 


aggtggcgaa 


acccgacagg 


16080 


actataaaga 


taccaggcgt 


ttccccctgg 


aagctccctc 


gtgcgctctc 


ctgttccgac 


1614 0 


cctgccgctt 


accggatacc 


tgtccgcctt 


tctcccttcg 


ggaagcgtgg 


cgctttctca 
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16200 


tagctcacgc 


tgtaggtatc 


tcagttcggt 


gtaggtcgtt 


cgctccaagc 


tgggctgtgt 


16260 


gcacgaaccc 


cccgttcagc 


ccgaccgctg 


cgccttatcc 


ggtaactatc 


gtcttgagtc 


16320 


caacccggta 


agacacgact 


tatcgccact 


ggcagcagcc 


actggtaaca 


ggattagcag 


16380 


agcgaggtat 


gtaggcggtg 


ctacagagtt 


cttgaagtgg 


tggcctaact 


acggctacac 


16440 




tagaagaaca 


gtatttggta 


tctgcgctct 


gctgaagcca 


gttaccttcg 


gaaaaagagt 


16500 


tggtagctct 


tgatccggca 


aacaaaccac 


cgctggtagc 


ggtggttttt 


ttgtttgcaa 


16560 


gcagcagatt 


acgcgcagaa 


aaaaaggatc 


tcaagaagat 


cctttgatct 


tttctacggg 


16620 


gtctgacgct 


cagtggaacg 


aaaactcacg 


ttaagggatt 


ttggtcatga 


gattatcaaa 


16680 


aaggatcttc 


acctagatcc 


ttttaaatta 


aaaatgaagt 


tttaaatcaa 


tctaaagtat 


16740 


atatgagtaa 


acttggtctg 


acagttacca 


atgcttaatc 


agtgaggcac 


ctatctcagc 


16800 


gatctgtcta 


tttcgttcat 


ccatagttgc 


ctgactcccc 


gtcgtgtaga 


taactacgat 


16860 


acgggagggc 


ttaccatctg 


gccccagtgc 


tgcaatgata 


ccgcgagacc 


cacgctcacc 


16920 


ggctccagat 


ttatcagcaa 


taaaccagcc 


agccggaagg 


gccgagcgca 


gaagtggtcc 


16980 


tgcaacttta 


tccgcctcca 


tccagtctat 


taattgttgc 


cgggaagcta 


gagtaagtag 


17040 


ttcgccagtt 


aatagtttgc 


gcaacgttgt 


tgccattgct 


acaggcatcg 


tggtgtcacg 


17100 


ctcgtcgttt 


ggtatggctt 


cattcagctc 


cggttcccaa 


cgatcaaggc 


gagttacatg 


17160 


atcccccatg 


ttgtgcaaaa 


aagcggttag 


ctccttcggt 


cctccgatcg 


ttgtcagaag 


17220 


taagttggcc 


gcagtgttat 


cactcatggt 


tatggcagca 


ctgcataatt 


ctcttactgt 


17280 


catgccatcc 


gtaagatgct 


tttctgtgac 


tggtgagtac 


tcaaccaagt 


cattctgaga 


17340 


atagtgtatg 


cggcgaccga 


gttgctcttg 


cccggcgtca 


atacgggata 


ataccgcgcc 


17400 


acatagcaga 


actttaaaag 


tgctcatcat 


tggaaaacgt 


tcttcggggc 


gaaaactctc 


17460 


aaggatctta 


ccgctgttga 


gatccagttc 


gatgtaaccc 


actcgtgcac 


ccaactgatc 


17520 


ttcagcatct 


tttactttca 


ccagcgtttc 


tgggtgagca 


aaaacaggaa 


ggcaaaatgc 


17580 


cgcaaaaaag 


ggaataaggg 


cgacacggaa 


atgttgaata 


ctcatactct 


tcctttttca 


17640 


atattattga 


agcatttatc 


agggttattg 


tctcatgagc 


ggatacatat 


ttgaatgtat 


17700 


ttagaaaaat 


aaacaaatag 


gggttccgcg 


cacatttccc 


cgaaaagtgc 


cac 


17753 
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<223> primer 
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gacttctaga ctggggcggg gtgggatgaa c 
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<210> 25 

<211> 33 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer 

<400> 25 

gtcatgaagc ttctcggcga ccagaacacc ttc 33 
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<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
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gacttctaga ctggggcggg gtgggatgaa c 
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